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Fatty Acid 9 -Hydroperoxide Lyase Nucleic Acid 

Seqp.ences 

5 

XNTRODUCTZON 

Technical Field 

This invention relates to the application of genetic 
10 engineering techniques to plants. More specifically, the 

invention relates to plant hydroperoxide lyase sequences cund 
methods for the use of such sequences . 



Background 

15 With the development of genetic engineering techniques, 

it is now possible to transfer genes from a variety of 
organisms into the genome of a large nimber of different plant 
species. This process has many advantages over plant breeding 
techniques, as genes may now be transferred from one pleuit 

20 species to another plant species, rather than simply from a 

plant to the same, or different, but closely related, species. 

Degradation of polyurisaturated fatty acids starts by the 
oxygenation at cis-cis doiible bonds of poly\insat\ira'ted fatty 
acids. This reaction is catalyzed by lipoxygenase (EC 

25 1.13.11.12) enzymes which are present in '^I'^ts, animals and 
microorganisms. The oxygenated prdducts, * called faitty acid 
hydroperoxides , are precursors for many ihrportant hormones 
(e.g. lipoxins, jasmonic acid/ traumiatic acid) aiid 
flavor/ fragrance molecules (e.g. cis^3-hexenol, l-octen-3-ol) 

30 in plants. 

Compounds, such as jasmonic acid, are produced from 
hydroperoxides, such as 13-hydroperoxylinolenic acid, via an 
allene oxide synthase (referred to as AOS) and an allene oxide 
cyclase (referred to as ACS) --dependent pathway. Jasmonic acid 

35 is involved in stress cind disease resistance signaling 
responses via the octadecanoid pathway. 13— 

hydroperoxylinolenic can also be cataboli zed by per oxygenases 
to form cutin monomers. Alternatively, 13 -hydrbperoxyliholenic 
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can be catabolyzed by hydroperoxide lyase eventually forming 
volatile aldehydes and traiimatic acid. 

Fatty acid hydroperoxide lyase (HPO lyase) catalyzes the 
cleavage of carbon-carbon bonds in polyunsaturated fatty acid 
5 hydroperoxides to produce short-chain aldehydes and (D-oxo- 
acids (Vick, et al. (1976) Plant Physiol. 57:780-788). The 
products of lysis of fatty acid hydroperoxides, such as short- 
chain volatile aldehydes are common in plant species. The 
aromas /flavors produced are a function of the specific 

10 products produced by the lysis of specific fatty acid 

hydroperoxides. For example, some short-chain volatile 
aldehydes contribute to the green notes'' in a wide variety of 
plant leaves, vegetables and fruits. '^Green notes are 
volatile molecules that contribute to the organoleptic 

15 qualities of flavor and fragrance of edible plant tissues. 
These qualities are often referred to as grassy, or ''green'" 
characteristics. Other short-chain volatile aldehydes, such 
as (3Z, 6Z) -nonadienol produced by the lysis of fatty acid 9- 
hydroperoxide by a fatty acid 9 -hydroperoxide lyase {9-HPO 

20 lyase or 9-HPOL) , contribute a melon aroma and/or a melon 
flavor, or sometimes referred to as a ''melon" or 'Afresh*' 
characteristic, to fruits and vegetables. Such 
characteristics are important to industries concerned with 
fragrances and flavorings . 

25 Furthermore, short- chain aldehydes are also thought to be 

involved in disease resistance. For example. Croft, et al 
((1993) Plant Physiol. 101:13-24) recently reported that {3Z)- 
hexenol and (2S)-hexenal levels increased during a 
hypersensitive-response in kidney bean plants; In addition, 

30 they also demonstrated that (2^) -hexenal is an effective 
sintibacterial agent. 

The characterization of hydroperoxide (also referred to 
as HPO lyase or HPOL) is useful for the further study of plant 
fatty acid metabolism systems and for the development of 

35 transgenic plant with increased organoleptic properties, 

including aromas and flavors. Studies of plant mechanisms may 
provide means to further enhance, control, modify, or 
otherwise alter the organoleptic qualities of edible plant 
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tissues. Furthermore, the elucidation of the physiological 
roles of HPO lyase and it's products may be useful for the 
further study of disease resistance responses, such as the HR 
response. Of particular interest are the nucleic acid 
5 sequences of genes encoding proteins which may be useful for 
applications in genetic engineering. 

Relevant Literature 

Shibata, et al . (1995) Plant Cell Physiol 36:147-156 
10 reports the purification of a HPO lyase from green bell 

pepper- Matsui, et ai . (1996) FEES Letters 394:21-24 reports 
the cloning of a HPO lyase from bell pepper and the protein 
encoded by the HPO lyase sequence resembles a cytochrome P450 . 

15 SUMMARY OF THE INVENTXON 

The present invention provides nucleic acid sequences 
encoding for proteins which catalyze the formation of short- 
chain aldehydes and oxo-acids from 9 -hydroperoxide fatty 
acids. Such proteins are referred to herein as 9- 
20 hydroperoxide lyases or 9 -HPO lyase. 

By this invention, nucleic acid sequences encoding plant 
9 -HPO lyase may now be characterized with respect to enzyme 
activity. 

. ^ ' Thus, this invention encompasses 9 -HPO lyase nucleic acid 
25 .sequences and the corresponding amino acid sequences , and; the 

use of these nucleic acid sequences in the preparation of 

oligonucleotides containing 9-HPO lyase encoding sequences for 

ana:iysis and recovery of plant 9-HPO lyase gene sequences. 

The 9-HPO lyase encoding sequence may encode a complete or 
30 partial sequence depending upon the intended use. All or a 

portion of the genomic sequence, or cDNA sequence, is 

•intended . 

' ■: Of special interest are recombinant DNA constructs which 
provide for transcription or transcription and translation 
35 (expression) of the 9-HPO lyase sequence in a host cell. In 
particular,; constructs which are capable of transcription .or 
transcription and trcuislation in plant host cells are 
preferred. For some applications a reduction in 9-HPO lyase 
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may be desired. Thus, recombinant constructs may be designed 
having the plant 9-HPO lyase sequences in a reverse 
orientation for expression of an anti-sense sequence or use of 
co-suppression, also known as - transwitch" , constructs may be 
useful. Such constructs may contain a variety of regulatory 
regions including transcriptional initiation regions obtained 
from genes preferentially expressed in plant seed tissue. For 
some uses, it may be desired to use the transcriptional and 
translational initiation regions of the 9-HPO lyase gene 
either with the 9-HPO lyase encoding sequence or to direct the 
transcription and translation of a heterologous sequence. 

In yet a different aspect, this invention relates to a 
method for producing a 9-HPO lyase in a host cell or progeny 
thereof via the expression of a construct in the cell. Cells 
containing the 9-HPO lyase as a result of the production of 
the 9-HPO lyase encoding sequence are also contemplated 
herein . 

In addition, methods for modifying the volatile aldehyde 
content in plant tissues, especially fruit and leaf tissue, as 
well as methods for producing transgenic plants with increased 
disease resistance are contemplated. 

Also considered in this invention are the modified 
plants, tissues and aldehyde compositions obtained by 
expression, of the 9-HPO lyase sequence and protein of this 
25 invention. , : 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the complete nucleotide sequence of the 
Arabidopsis HPO lyase. 
50 , Figure 2 shows a comparison of the amino acid sequences 
of the bell , pepper HPO lyase and the Arabidopsis HPO lyase- 
Mke sequence ; 

■■■ ' Figure 3 shows a comparison of the amino acid sequences 
of the Arabidopsis allene oxide synthase and the Arajbidqpsis 
35 HPO lyase- like sequence. 

Figure -4 shows the complete nucleotide sequence of the 
tomato HPO. lyase. 



20 
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Figure 5 shows the complete nucleotide sequence of the 
cucumber allene oxide synthase - 

Figure 6 shows the complete nucleotide, sequence of the 
cucumber 9 -Hydroperoxide Lyase. 
5 Figure 7 shows the amino acid sequence alignment between 

the bell pepper, banana, and Arabidopsis HPO lyase, with the 
highly conserved peptide sequences highlighted. 

Figure 8 Provides the percent similarity in the upper 
right corner and the percent divergence in the lower right 
10 corner for the nucleotide sequences (Figure 8A) and amino acid 
sequences (Figure 8A) of the bell pepper HPOL (CaHPOL) , tomato 
fruit HPOL (LeHPOL) , cucumber hypocotyl HPOL (CsC17HP0L, 
pseudogene) , Arabidopsis inf lorecence HPOL (AtHPOL) , banana 
leaf HPOL ( Ms HPOL ), cucumber hypocotyl 9 -HPOL (CslBHPOL) , 
15 Guayule AOS (GuAOS) , flaxseed AOS (LiAOS) , and the Arabidopsis 
AOS (AtAOS) . 

Figure 9 shows the gas chromatography (GC) analysis of 
the cuciunber 9-HPO lyase using linoleic acid 13 -hydroperoxide 
(Figure 9A) and linoleic acid 9- hydroperoxide (figure 9B) 
20 substrates. 

Figure 10 provides the results of the spectrophotometric 
assay of the: cucumber 9 -HPO lyase expresised from E. col i using 
linoleic acid 13 -hydroperoxide and linoleic acid 9- 
hydroperoxide substrates.. 

25 - ^ • ■ 

DETAILED DESCRXPTION OF THE INVENTXON 

In accprdcince with the subject invention, nucleotide 
sequences are provided which are capable of coding sequences 
of amino acids, such as, a protein, polypeptide or peptide, 

30 which demonstrate the ability to form short-chain aldehydes 
and oxo-acids from 9 -hydroperoxide fatty acids under plant 
enzyme reactive conditions. By V^enzyme reactive conditions" 
is meant that any necessary conditions are available in an 
environment (i.e., such factors as temperature, pH, lack of 

35 inhibiting sxibstances) which will permit the enzyme to 
function . . * : 

In one embodiments of the present invention , nucleic acid 
sequences are provided which encode for 9 -hydroperoxide lyase 
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(hereinafter referred to as 9-HPO lyase) . Such nucleic acid 
sequences are demonstrated herein to encode enzymes which form 
(3Z, 6Z) -nonadienal and 9-oxo-nonanoic acid from Linolenic acid 
9 -hydroperoxide or 9-hydroperoxy- (lOE, 12Z, 15Z)- 
5 octadecadienoic acid. 

In the examples provided below, a nucleic acid sequence 
from cucumber (Cucumis sativus) is identified from cDNA 
libraries made from total RNA isolated from cucumber 
hypocotyls, A full length coding sequence is obtained, and 

10 the product encoded by the full length sequence demonstrates 
activity towards the substrate linolenic acid 9 -hydroperoxide 
to produce (3Z, 6Z) -nonadienal and 9-oxo-nonanoic acid. 

In another embodiment of the present invention, 
e^qpression constructs are provided which direct the expression 

15 of nucleic acid sequences encoding 9-HPO lyase in bacterial 
and plant tissues. 

Of particular interest in the present invention, is the 
use of such es^ression constructs to produce transgenic plants 
with increased production of short-chain volatile aldehydes in 

20 plant fruits and tissues. Such volatile aldehydes are 
important constituents of the characteristic flavors of 
fruits, vegetables euid green leaves. Thus, the 9-HPO lyase 
sec[uence of the present invention may be used in expression 
constructs to produce transgenic plcuats with improved 

25 organoleptic properties, in particular improved fruity (or 
*melon") note flavor and aroma characteristics. 

The nucleic acid sequences of the present invention may 
also find use in e^^ression constructs for the production of 
transgenic plants with increased resistance to various 

30 pathogens. Transgenic plants expressing the HPO lyase 

sequence of the present invention may exhibit an enhanced 
hypersensitive-reaction (HR response) in response to pathogen 
attack due to the increased production of aldehydes involved 
in the HR response, such as (3Z)-hexenal and (2S)-hexenal 

35 (Croft, et al. (1993) Plant Physiol. 101:13-24). Aldehydes, 
such as (2S) -hexenal; have also been shown to be effective 
ant i -bacterial agents > further contributing to enhanced 
disease resistance (Croft, et al. (1993) , supra) . 
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Furthermore, these compounds may be involved in a general 
woiinding response in plants. 

Also of particular interest in the present invention is 
the use of 9-HPO lyase nucleic acid sequences in constructs to 
5 direct the expression of 9-HPO lyase in a prokaryotic and/or 
eukaryotic host cells for the production of flavorings and 
aromas . 

To confirm the activity and specificity of proteins 
encoded by nucleic acid sequences as 9-HPO lyase enzymes, 

10 assays are performed on protein extracts of cultured bacterial 
cells expressing the protein. As described in more detail in 
the examples, cuciimber 9-HPO lyase expression constructs are 
prepared to direct the expression of the 9-HPO lyase sequence 
in E coli. The expressed 9-HPO lyase enzymes demonstrate the 

15 ability to produce (3Z, 6Z) -nonoadienal and 9-oxo-noncaioic 
acid from linolenic acid 9 -hydroperoxide or 9-hydroperoxy- 
(lOE, 12Z, 15Z) -octadecadienoic acid by gc analysis. 

The skilled artisan will recognize that several methods 
for the identification of additional sequences encoding ,9 r-HPO 

20 lyase are available in the art. 

For example, antibody preparations, nucleic acid probes 
(DNA and RNA) and the like may be prepared and used to screen 
and recover "homologous" or "related" HPO lyase sequences from 
a variety of plant sources. Typically, nucleic acid probes 

25 are labeled to allow detection, preferably with radioactivity 
although enzymes or other methods may also be used. For 
dLmmiano logical screening methods, antibody preparations either 
monoclonal or polyclonal may be utilized. Polyclonal 
antibodies, although less specific, typically are more useful 

30 in gene isolation. For detection, the antibody is labeled 
using radioactivity or any one of a variety of second 
antibody /enzyme conjugate systems that are commercially 
available. Examples of some of the available antibody 
detection systems are described by Oberf ilder (Focus (1989) 

35 BRL Life Technol . , Inc . , 11 : 1-5) . 

In order to obtain additional HPO lyase sequences, a 
genomic or other appropriate library prepared from the 
candidate plant source of interest may be probed with 
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conserved sequences from one or more plant HPO lyase (s) to 
identify homologously related sequences. Positive clones may 
be analyzed by restriction enzyme digestion and/or sequencing. 
When a genomic library is used, one or more sequences may be 
identified providing both the coding region, as well as the 
transcriptional regulatory elements of the HPO lyase gene from 
such plant source. Probes can also be considerably shorter 
than the entire sequence. Oligonucleotides may be used, for 
example, but should be at least about 10, preferably at least 
about 15, more preferably at least 20 nucleotides in length. 
When shorter length regions are used for comparison, a higher 
degree of sequence identity is required than for longer 
sequences. Shorter probes are often particularly useful for 
polymerase chain reactions (PGR) , especially when highly 
15 conserved sequences can be identified. (See, Gould, efc al.. 
PNAS USA (1989) 55:1934-1938.) 

When longer nucleic acid fragments are employed (>100 bp) 
as probes, especially when using complete or large cDNA 
sequences, one can still screen with moderately high 
stringencies (for example using 50% formamide at 37^0 with 
minimal washing) in order to obtain signal from the target 
saiople with 20-50% deviation, i.e., homologous sequences . • 
(For additional information regarding screening techniques see 
Beltz, et al., Meth. Enzymology (1983) 100:266-285). 

Homologous sequences are foxind when there is an identity 
of sequence and may be determined upon comparison of sequence 
information, nucleic acid or amino acid, or through 
hybridization reactions between a known HPO lyase and a 
candidate source. Conservative changes, such as Glu/Asp, 
Val/Ile, Ser/Thr, Arg/Lys and Gln/Asn may also be considered 
in determining sequence homology. Typically, a lengthy 
nucleic acid sequence may show as little as 50-60% sequence 
identity, and more preferably at least about 70% sequence 
identity, between the target sequence and the given plant HPO 
35 lyase of interest excluding any deletions which may be 
present, and still be considered related. Amino acid 
sequences are considered homologous by as little as 20% 
sequence identity between the two complete mature proteins. 



20 
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(See generally, Doolittle, OF URFS and ORES (University 

Science Books, CA, 1986.) 

In addition, not only can sequences provided herein be 
used to identify homologous 9 -hydroperoxide lyases, but the 
5 resulting sequences obtained therefrom may also provide a 

further method to obtain plant 9 -hydroperoxide lyase sequences 
from other plant sources. In particular, PGR may be a useful 
technique to obtain related 9-HPO lyases from sequence data 
provided herein. One skilled in the art will be able to 

10 design oligonucleotide probes based upon sequence comparisons 
or regions of typically highly conserved sequence. 
Furthermore, additional sequences may be added to the 5' end 
of the primer to include restriction endonuclease cleavage 
sites to allow for convenient cloning of the PGR fragment into 

15 cloning vectors. Addition of such 5' ^tails' is well known in 
the art, and do not interfere with amplification as mismatched 
nucleotides at the 5' end of primers are not required for 
amplification. 

For example, as demonstrated in the examples that follow, 

20 amino acid sequence alignments between sequences encoding 13- 
hydroperbxy lyase from bell pepper (Matsui, et al. (1996) FEBS 
Iietters 394:21-24) , banana (European Patent Application, 
Ptiblication Number EP 0 801 133 A2, the entirety of which is 
incorporated herein by reference) and Arabldopsis identify 

■25 highly conserved peptide sequences • Synthetic oligonucleotide 
primers are constructed based on these peptide sequences and : 
used in PGR reactions to amplify cDNAs from several plant 
tissue sources which are highly similar to known HPO lyase 
sequences. Gomplimentary DNA sequences are identified from 

30 tomato hypocotyl and fruit tissues and cucumber hypocotyl 

tissue which are highly homologous to the bell pepper 13-HPO 
lyase. However, a second cDNA sequence is obtained from the 
cucximber hypocotyl tissue which is divergent from the 13-HPO 
lyase sequences. Furthermore, the sequence shows a slightly 

35 higher similarity to allene oxide synthases. 

The nucleotide sequence obtained from any method may be 
molecular ly cloned into an appropriate vector for propagation 
of the DNA by methods known in the art. Many cloning vectors 
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are available commercially, and may be employed used with the 
nucleotide sequences of the present invention. For example, 
the DNA may be inserted into a pBluescript vector (Stratgene, 
La Jolla, CA) . Typically, useful cloning vectors for 
5 bacterial use can comprise a selectable marker and a bacterial 
origin of replication derived from commercially available 
plasmids which are in turn derived from the well known cloning 
vector PBR322 (ATCC 37017). Further examples of cloning 
vectors include the pGEM vectors (Promega Biotec, Madison, 
10 WI) . In addition, linear cloning vectors with single thymine 
("T") overhangs are available for convenient cloning of PGR 
fragments amplified utilizing Taq DNA polymerase which usually 
adds an adenine (»A") to the end of the amplified fragment. 
Such vectors include pCR2 . 1 vector (Invitrogen, La Jolla, CA) . 

Cloned DNA sequences may be expressed in hosts to confirm 
HPO lyase activity. Techniques and expression vectors for 
expression in such hosts are well known in the art and are 
available from a wide variety of commercial sources. Exaii«>les 
of such vectors include PQE30 (Qiagen, Hilden, Germany). 
Furthermore, the sequences of the DNA coding regions can be 
optimized by gene resynthesis, based on codon usage, for 
maximum expression in particular hosts. 

As described in more detail in the examples that follow, 
expression of the cucumber HPO lyase sequence in E coli 
demonstrates that the sequence encodes a 9 -hydroperoxide 
lyase. This is the first reported sequence cloned, to the 
inventor's knowledge, encoding a hydroperoxide lyase with 
activity on linolenic acid 9 -hydroperoxide substrates. 

Once the nucleic acid sequence is obtained, the 
transcription, or transcription and translation (expression), 
of the 9-HPO lyase in' a host cell is desired to produce a 
ready source of the enzyme and/or modify the composition of 
fatty acids and/or volatile compound foumd therein. Other 
useful applications may be found when the host cell is a plant 
35 host cell, in vitro and in vivo. 

Nucleic: acids' (genomic DNA, plasmid DNA, cDNA, synthetic 
DNA, mRNAv etc.) encoding HPO lyase or amino acid sequences of 
the spuri fled enzymes, which permit design of nucleic acid 
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probes facilitating the isolation of DNA coding sequences 
therefor, are known in the art and are available for use in 
the methods of the present invention. It is generally 
recognized to an artisan skilled in the field to which the 
5 present invention pertains that the nucleic acid sequences 

provided herein and the amino acid sequences derived therefrom 
may be used to isolate other potential HPO lyase genes from 
GenBank or other database using DNA and peptide search 
techniques generally known in the art. 
10 In addition to the sequences described in the present 

invention, DNA coding sequences useful in the present 
invention can be derived from algae, fungi, bacteria, 
mammalian sources, plants, etc. Homology searches in existing 
databases using signature sequences corresponding to conserved 
15 nucleotide and amino acid sequences of 9-HPO lyase can be 
en^loyed to isolate equivalent, related genes from other 
sources such as plants and microorganisms . Searches in EST 
datcJDases can also be employed. Furthermore^ the use of DNA 
sequences encoding enzymes functionally enzymatically 
20 equivalent to those disclosed herein, wherein such DNA 

sequences are degenerate equivalents of the nucleic acid 
sequences disclosed herein in accordance with the degeneracy 
of the genetic code, is also encompassed by the present 
invention i Demonstration of the functionality of coding 
25 sequences identified by cuiy of these methods can be carried 
out by expression of such sequences in various expression 
systems employing various organisms. 

Furthermore, constructs for the expression of the 9-HPO 
lyase sequences in plants are described. Also, the use of 
30 such 9-HPO lyase expression constiructs for increasing aldehyde 
and oxo-acid production in plant tissues is contemplated in 
the present invention. The 9-HPO lyase expression constructs 
may be employed to produce a host plant having modified 
organoleptic properties. By organoleptic properties is meant 
35 as relating to qualities, such as taste, color, odor, and 

feel, of a substance; such as a food, that stimulate the sense 
organs! Furthermore, Constructs may be prepared to direct the 
expression of = the 9-HPO lyase sequence in a host cell to 
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provide for the production of aldehydes and oxo-acids for use 
in flavorings and fragrances. 

The nucleic acid sequences which encode 9 -hydroperoxide 
lyase may be used in various constructs, for example, as 
probes to obtain further sequences. Alternatively, these 
sequences may be used in conjunction with appropriate 
regulatory sequences to increase levels of the respective 9- 
HPO lyase of interest in a host cell for recovery or study of 
the enzyme in vitro or in vivo or to decrease levels of the 
respective 9-HPO lyase of interest for some applications when 
the host cell is a plant entity, including plant cells, plant 
parts (including but not limited to seeds, cuttings or 
tissues) cuid plants. 

Thus, depending upon the intended use, the constructs may 
15 contain the nucleic acid sequence which encodes the entire 9- 
HPO lyase protein, or a portion thereof. For example, where 
antisense inhibition of a given 9-HPO lyase protein is 
desired, the entire 9-HPO lyase sequence Is not required. 
Furthermore, where 9-HPO lyase constructs are intended for use 
as probes, it may be advantageous to prepare constructs 
containing only a particular portion of a 9-HPO lyase encoding 
sequence, for example a sequence which, is discovered to encode 
a highly conserved 9-HPO lyase region. 

As discussed above, nucleic acid sequence encoding a 
25 plant or other 9-HPO lyase of this invention may include 

genomic, cDNA or mRNA sequence. By "encoding- is meant that 
the sequence corresponds to a particular amino acid sequence 
either in a sense or anti-sense orientation. By 
"extrachromosomal- is meant that the sequence is outside of 
30 the plant genome of which it is naturally associated. By 
"recombinant" is meant that the sequence contains a 
genetically engineered modification through manipulation via 
mutagenesis, restriction enzymes, and the like. 

A CDNA sequence may or may not contain pre-processing 
sequences, such as transit peptide sequences or targeting 
sequences to facilitate delivery of the 9rHP0 lyase protein 
to a given organelle or membrane, location. ; The use of any 
such precursor 9-HPO lyase DNA sequences is .preferred for uses 
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in plant cell expression. A genomic 9-HPO lyase sequence may 
contain the transcription and translation initiation regions, 
introns, and/or transcript termination regions of the plant 9- 
HPO lyase, which sequences may be used in a variety of DNA 
. 5 constructs, with or without the 9-HPO lyase structural gene. 
Thus, nucleic acid sequences corresponding to the 9-HPO lyase 
of this invention may also provide signal sequences useful to 
direct protein delivery into a particular organellar or 
membrane location, 5' upstream non-coding regulatory regions 

10 (promoters) having useful tissue and timing profiles, 3' 
downstream non-coding regulatory regions useful as 
transcriptional and translational regulatory regions, and may 
lend insight into other features of the gene. 

Once the desired plant or other 9-HPO lyase nucleic acid 

15 sequence is obtained, it may be meuaipulated in a variety of 
ways . Where the sequence involves non-coding flanking 
regions, the flanking regions may be sxabjected to resection, 
mutagenesis, etc. Thus, transitions, transversions , 
deletions, and insertions may be performed on -the naturally 

20 occurring sequence; In addition, all or part of the sequence 
may be synthesized. In the structural gene* one or more 
codons may be modified to provide for a modified, amino acid 
sequence, or one or more codon mutations may be introduced to 
provide for a convenient restriction, site or other purpose 

25 involved with construction or expression. The structural gene 
may be further modified by employing synthetic adapters, 
linkers to introduce one or more convenient restriction sites, 
or the like. 

The nucleic acid or amino acid sequences encoding a plant 
30 or other 9-HPO lyase of this invention may be combined with 

other non-native, or "heterologous", sequences: in a variety of 
ways. By "heterologous" sequences is meant any sequence which 
is not naturally found joined to the native (or wild- type) 9- 
HPO lyase, including, for example, combinations of nucleic 
35 acid sequences from the same plemt which are not naturally 
found joined together. ^ 

The DNA sequence encoding a plant .or } other> 9rHPO. lyase of 
this invention may be employed in conjunction with all or part 
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Of the gene sequences normally associated with the 9-HPO 
lyase. in xts component parts, a DNA sequence encoding 9-HPO 
lyase xs combined in a DNA construct having, in the 5' to 3' 
direction of transcription, a transcription initiation control 
regxon capable of promoting transcription and translation in a 
host cell, the DNA sequence encoding plant 9-HPO lyase and a 
transcription and translation termination region 

Potential host cells include both prokaryotic and 
eukaryotic cells, a host cell may be unicellular or found in 
a multxcellar differentiated or undifferentiated organism 
depending upon the intended use. Cells of this invention may 
be dxstxnguished by having a 9-HPO lyase foreign to the wild- 
type cell present therein, for example, by having a 
recombinant nucleic acid construct encoding a 9-HPO lyase 
15 therein not native to the host species 

Prokaryotic cells include gram negative as well as gram 
posxtxve bacteria, for example E. coli, and B. subtilis 
straxns. Suitable examples are well known to the skilled 

20 TaT^' ^"^^^^^^ — <^etail in the examples that 

20 follow, an HPO lyase isolated from cucumber hypocotyl is 

e^ressed in E coli, strain M15. protein expressed from 

the E colx xs capable of producing the aldehyde 3 (Z,-nonenal 
and 2(E)-nonenal from linoleic acid 9 -hydroperoxide. -Phus 

25 9 "p^lyar^ '^"^'^ '^^^ ^^^^^^ ^ -<^-'a 

Eukaryotic host cells include fungi, including yeasts 
xnsect cells, and plant cells. Methods for the expression' of 
sequences of interest in yeast cells are known in the art 
and are generally described in "Guide to yeast genetics and 
molecular bxology-. Guthrie and Fink, eds. Methods in 
enzymology , Academic Press, inc. Vol 194 (1991, and Gene 
expression technology-, Goeddel ed. Methods in Enzymology 
Academxc Press, Inc., vol 185 (1991). m addition, methods 
for the expression of 13-hpo lyase genes are described in 
European patent Application EP 0 801 133 A2. the entirety of 
whxch xs incorporated herein by reference 

^ ^ The fungal recombinant vector may be "any vector which can 
be. convenxently subjected to recombinant DNA procedures, 
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choice of a vector will typically depend on the compatibility 
of the vector with the fungal host cell into which the vector 
is to be introduced. The vector may be a linear or a closed 
circular plasmid. The vector system may be a single vector or 
5 plasmid or two or more vectors or plasmids which together 

contain the total DNA to be introduced into the genome of the 
fungal host. 

The fungal vector may be an autonomously replicating 
vector, i.e., a vector which exists as an extrachromosomal 

10 entity, the replication of which is independent of chromosomal 
replication, e.g., a plasmid, an extrachromosomal element, a 
minichromosome, or an artificial chromosome. The vector may 
contain any means for assuring self -replication. 
Alternatively, the vector may be one which, when introduced 

15 into the fungal cell, is integrated into the genome and 

replicated together with the chromosome ( s ) into which it has 
been integrated. For integration, the vector may rely on the 
nucleic acid sequence of the vector for stable integration of 
the vector into the genome by homologous or nonhomologous 

20 recombination. Alternatively, the vector may contain 

additional nucleic acid sequences for directing integration by 
homologous recombination into the genome of the fungal host. 
The additional nucleic acid sequences enable the vector to be 
integrated into the host cell genome at a precise location (s) 
,25 in the chromosome (s) . To increase the likelihood of 

integration at a precise location/ there should be preferably 
two nucleic acid sequences which individually contain a 
sufficient nximber of nucleic acids, preferably 400bp to 
1500bp, more preferably 800bp to lOOObp, which are highly 

30 homologous with the corresponding target sequence to enhance 
the probability of homologous recombination. These nucleic 
acid sequences may be any sequence that is homologous with a 
target sequence in the genome of the fungal host cell and, 
furthermore, may be non-encoding or encoding sequences. 

35 For autonomous replication, the vector may further 

comprise an origin of replication enabling the vector to 
replicate autonomously in the host cell in question. Examples 
of ' origin of replications for use in a yeast host cell are the 
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2 micron origin of replication and the combination of CEN3 and 
ARS 1. Any origin of replication may be used which is 
compatible with the fungal host cell of choice. 

The fungal vectors of the present invention preferably 
contain one or more selectable markers which permit easy 
selection of transformed cells, a selectable marker is a gene 
the product Of which provides, for example biocide or viral 
resistance, resistance to heavy metals, prototrophy to 
auxotrophs and the like. The selectable marker may be 
selected from the group including, but not limited to, aindS 
(acetamidase) , argB (ornithine carbamoyltransf erase) , bar 
(Phosphinothricin acetyltransf erase) , hygB (hygromycin 
phosphotransferase), niaD (nitrate reductase), pyrG 
(orotidine-5' -phosphate decarboxylase) and sC (sulfate 
adenyltransferase) and trpC (anthranilate synthase) 
Preferred for use in an Aspergillus cell are the amd5 and pyrG 
markers of Aspergillus niduians or Aspergillus oryzae and the 
bar marker of Streptomyaes hygrosaopicus . Furthermore, 
selection may be accomplished by co- trans formation, e.g.. as 
described in WO 91/17243, the entirety of which is herein 
incorporated by reference. A nucleic acid sequence of the 
present invention may be operably linked to a suitable 
promoter sequence. The promoter sequence is a nucleic acid 
sequence which is recognized by the fungal host cell for 
expression of the nucleic acid sequence. The promoter 
sequence contains transcription and translation control 
sequences which mediate the expression of the protein or 
fragment thereof. 

A promoter may be any nucleic acid sequence which shows 
transcriptional activity in the fungal host cell of choice and 
inay be obtained from genes encoding polypeptides either 
homologous or heterologous to the host cell. Examples of 
suitable promoters for directing the transcription of a 
nucleic acid construct of the invention in a filamentous 
fungal host are promoters obtained from the genes encoding 
Aspergillus oryzae TAKA amylase, Ehizomucor miehei aspartic 
proteinase, Aspergillus niger neutral alpha-amylase 
Aspergillus niger acid stable alpha-amylase, Aspergillus niger 
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or Asperffillus awamori glucoamylase (glaA) , Rhizomucor miehel 
lipase, Aspergillus oxyzae alkaline protease, Aspergillus 
oryzae triose phosphate isomerase, Aspergillus nidulans 
acetamidase and hybrids thereof. In a yeast host, a useful 
5 promoter is the Saccharomyces cerevisiae enolase (eno-1) 
promoter* Particularly preferred promoters are the TAKA 
amylase, NA2-tpi (a hybrid of the promoters from the genes 
encoding Aspergillus niger neutral alpha -amylase and 
Aspergillus oryzae triose phosphate isomerase) and glaA 

10 promoters. 

A protein or fragment thereof encoding nucleic acid 
molecule of the present invention may also be operably linked 
to a terminator sequence at its 3' terminus. The terminator 
sequence may be native to the nucleic acid sequence encoding 

15 the protein or fragment thereof or may be obtained from 

foreign sources . Any terminator which is fiinctional in the 
fungal host cell of choice may be used in the present 
invention, but particularly preferred terminators are obtained 
from the genes encoding Aspergillus oryzae TAKA amylase, 

20 Aspergillus niger glucoamylase, Aspergillus nidulans 

an thranilate synthase, Aspergillus niger alpha-glucosidase and 
SaGcharomycies cerevisiae enolase. 

^ A protein or fragment thereof encoding nucleic acid 
molecule of .the present invention may also be opereJply linked 
.25 to a suitable leader sequence. A leader sequence is a 
nontranslated region of a itiRNA which is irtqportant for 
translation by the fungal host. The leader sequence is 
operably linked to the 5' terminus of the nucleic acid 
sequence encoding the protein or fragment thereof . The leader 

30 sequence may be native to the nucleic acid sequence encoding 
the protein or fragment thereof or may be obtained from 
foreicpi sources . Any leader sequence which is f\inctional in 
the fungal host cell of choice may be used in the present 
invention, but particularly preferred leaders are obtained 

35 from the genes encoding Aspergillus oryzae TAKA amylase eind 
Aspergillus oryzae ^ triose phosphate isomerase. 

'J ' A polyadenylation sequence may also be opercQ^ly linked to 
the 3' terminus^ of the. nucleic acid sequence of the present 
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invention. The polyadenylation sequence is a sequence which 
when transcribed is recognized by the fungal host to add 
polyadenosine residues to transcribed mRNA. The 
polyadenylation sequence may be native to the nucleic acid 
sequence encoding the protein or fragment thereof or may be 
obtained from foreign sources. Any polyadenylation sequence 
which is functional in the fungal host of choice may be used 
m the present invention, but particularly preferred 
polyadenylation sequences are obtained from the genes encoding 
Aspergillus oryzae TAKA amylase, Aspergillus niger 
glucoamylase, Aspergillus nidulans anthranilate synthase and 
Aspergillus niger alpha-glucosidase. 

TO avoid the necessity of disrupting the cell to obtain 
the protein or fragment thereof and to minimize the amount of 
possible degradation of the expressed protein or fragment 
thereof within the cell, it is preferred that expression of 
the protein or fragment thereof gives rise to a product 
secreted outside the cell. To this end, a protein or fragment 
thereof of the present invention may be linked to a signal 
peptxde linked to the amino terminus of the protein or 
fragment thereof. A signal peptide is an amino acid sequence 
whxch permits the secretion of the protein or fragment thereof 
from the fungal host into the culture medium. The signal 
peptide may be native to the protein or fragment thereof of 
the invention or may be obtained from foreign sources. The 5' 
end of the coding sequence of the nucleic acid sequence of the 
present invention may inherently contain a signal peptide 
coding region naturally linked in translation reading frame 
with the segment of the coding region which encodes the 
secreted protein or fragment thereof. Alternatively, the 5- 
end of the coding sequence may contain a signal peptide coding 
region which is foreign to that portion of the coding sequence 
which encodes the secreted protein or fragment thereof. The 
foreign signal peptide may be required where the coding 
sequence does not normally contain a signal peptide coding 
region. Alternatively, the foreign signal peptide may simply 
replace the natural signal peptide to obtain enhanced 
secretion of the desired protein or fragment thereof The 
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foreign signal peptide coding region may be obtained from a 
glucoamylase or am amylase gene from an Asperg^i II us species , a 
lipase or proteinase gene from Rhi^omucor miehei, the gene for 
the alpha-factor from SaccharomycGs cerevisia.e, or the calf 
5 preprochymosin gene. An effective signal peptide for fungal 
host cells is the Aspergillus oryzae TAKA amylase signal, 
Aspergillus niger neutral amylase signal, the Rhizomucor 
miehei aspartic proteinase signal, the Humicola. lanuginosus 
cellulase signal, or the Rhizomucor miehei lipase signal. 

10 However, any signal peptide capable of permitting secretion of 
the protein or fragment thereof in a fungal host of choice may 
be used in the present invention. 

A protein or fragment thereof encoding nucleic acid 
molecule of the present invention may also be linked to a 

15 propeptide coding region. A propeptide is an amino acid 
sequence foumd at the amino terminus of aprpprotein or 
proenzyme. Cleavage of the propeptide from the proprotein 
yields a mature biochemically active protein. The resulting 
polypeptide is known as a propolypeptide or proenzyme (or a 

20 zymogen in some cases). Propolypeptides are generally 

inactive and can be converted to mature active polypeptides by 
catalytic or autocataly tic cleavage of the propeptide from the 
propolypeptide or proenzyme, .The propeptide coding region may 
be native to the protein or fragment thereof or may be 

25 obtained from foreign sources . The forei^ propeptide coding 
region may be obtained from the Sacciiarojnyces cerevisiae 
alpha-factor gene or Myceliophthora. thermophila, laccase gene 
(WO 95/33836, the entirety of which is herein incorporated by 
reference) . 

30 The procedures used to ligate, the elements described 

above to construct the recombinant expression vector of the 
present invention are well known to one skilled in the art 
(see, for example, Sambrook et al . , Molecular Cloning, A 
Laboratory Manual, 2nd ed. , Cold Spring Harbor, N.Y. , (1989)). 

35 The present ; invention also relates , to recombinant fiingal 

host cells produced by the methods of the present invention 
which are advantageously used with the recombinant vector of 
the present invention. The cell ,4-^ P^^.f^^ably transformed 
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with a vector comprising a nucleic acid sequence of the 
invention followed by integration of the vector into the host 
chromosome. The choice of fungal host cells will to a large 
extent depend upon the gene encoding the protein or fragment 
thereof and its source. The fungal host cell may, for 
example, be a yeast cell or a filamentous fungal cell. 

"Yeast" as used herein includes Ascosporogenous yeast 
{EndomycetalBs) . Basidiosporogenous yeast and yeast belonging 
to the Fungi Imperfecti (Blastomyaetes) . The Ascosporogenous 
yeasts are divided into the families SpermophthoraceaB and 
Saccharomycetaceae. The latter is comprised of four 
subfamilies, SchizosaccharomycoideaB (for example, genus 
Schizosaccharomyces) , Nadsonioideae , Lipomycoideae and 
Saccharoiaycoidoae (for example, genera Pichia, Kluy^^eromyces 
and Sacaharomyces) . The Basidiosporogenous yeasts include the 
genera Leucosporidim, Rhodosporidium, Sporidioholus , 
Filobasidivm and Filobasidiella. Yeast belonging to the Fungi 
Xmperfecfci are divided into two families, £t>oroJboIoinycetaceae 
(for example, genera SorojboJojnyces and BulJera) and 
Cryptococcaceae (for example, genus Candida) . Since the 
classification of yeast may change in the future; for the 
purposes of this invention, yeast shall be defined as 
described in Biology and Activities of Yeasf .(Skinner et al., 
Soc. App. Bacterxol. Symposium Series No . 9,- (19801 . the 
entirety of which is herein incorporated by .reference) . The 
biology of yeast and manipulation of yeast genetics are well 
known in the art (see, for example. Biochemistry and Genetics 
of Yeast, Bacil et al. (ed. ) , 2nd edition, 1987; The yeasts. 
Rose and Harrison (eds.), 2nd ed., (1987); and The Molecular 
Biology of the Yeast Saccharomyces , Strathern et al. (eds.), 
(1981) , all of which are herein incorporated by reference in 
their entirety) . 

The recombinant fimgal host cells of the present 
invention may further comprise one or more sequences which 
encode one or more factors that are advantageous in the 
expression of the protein or fragment thereof, for exaitple, an 
activator (e.g., a trans-acting factor), a chaperone and- a 
processing protease. The nucleic acids encoding one; or more 
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of these factors are preferably not operably linked to the 
nucleic acid encoding the protein or fragment thereof. An 
activator is a protein which activates transcription of a 
nucleic acid sequence encoding a polypeptide (Kudla et al . , 
5 EMBO 9:1355-1364(1990); Jarai and Buxton, Current Genetics 
26:2238-244(1994); Verdier, Yeast 6:271-297(1990), all of 
which are herein incorporated by reference in their entirety) . 
The nucleic acid sequence encoding an activator may be 
obtained from the genes encoding Saccharomyces cerevisiae heme 

10 activator protein 1 (hapl) , Saccharomyces cerevisiae galactose 
metabolizing protein 4 (gal4) and Aspergillus nidulans ammonia 
regulation protein (areA) . For further examples, see Verdier, 
yeast 5:271-297 (1990); MacKenzie et al. , Journal of Gen. 
Microbiol. 135:2295-2307 (1993), both of which are herein 

15 incorporated by reference in their entirety) . A chaperone is 
a protein which assists another protein in folding properly 
(Hartl et al., TIBS 15:20-25 (1994); Bergeron et al., TIBS 
19:124-128 (1994) ; Demolder et al., *7. Biotecimolofiry 32 : 179- 
189 (1994); Craig, Science 250:1902-1903(1993); Ge thing and 

20 Sambrook, Nature 355:33-45 (1992); Puig and Gilbert, J Biol: 
Chem. 255:7764-7771 (1994) ; Wang and Tsou, FASEB ^Journal 
7:1515-11157 (1993); Robinson et al., Bio/Tecimology 1:381-384 
(1994) , all of which are herein incorporated by reference in 
their entirety) . The nucleic acid sequence encoding a 

25 chaperone may be obtained from the genes encoding Aspergillus 
oxyzae protein disulphide isomerase/ Saccharomyces cerevisiae 
calnexin, Saccharomyces cerevisiae BiP/GRP78 and Saccharomyces 
cerevisiae Hsp70. For further examples, see Gething and 
Sambrook, Nature 355:33-45 (1992); Hartl et al., TIBS 19:20-^25 

30 (1994) . A processing protease is a protease that cleaves a 
propeptide to generate a mature biochemically active 
polypeptide (Enderlin and Ogrydziak, Yeast 10:67-79 (1994); 
Fuller et al., Proc. Natl. Acad. Sci. (U.S.A.) 85:1434-1438 
(1989); Julius et al.. Cell 37: 1075-1089 (1984) ; Julius et 

35 al.. Cell 32:839-852 (1983), all of which are incorporated by 
reference in their entirety) . The nucleic acid sequence 
encoding a processing protease may be obtained from the genes 
encoding Aspergillus niger Kex2, Saccharomyces cerevisiae 
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dipeptidylaminopeptidase, Saccharomyces aerevisiae Kex2 and 
Yarrowia lipolytiaa dibasic processing endoprotease (xpr6) 
Any factor that is functional in the fungal host cell of 
choice may be used in the present invention. 
5 Fungal cells may be transformed by a process involving 

protoplast formation, transformation of the protoplasts and 
regeneration of the cell wall in a manner known per se 
suitable procedures for transformation of Aspergillus host 
cells are described in EP 238 023 and Yelton et al Proc 
10 Natl. Acad. Sci. (U.S.A.) 81:1470-1474 (1984), both' of which 
are herein incorporated by reference in their entirety a 
suitable method of transforming Fusarium species is described 
by Malardier et aJ., Gene 78..147-156 (1989), the entirety of 
Which is herein incorporated by reference. Yeast may be 
15 transformed using the procedures described by Becker and 
Guarente, In: Abelson and Simon, (eds.). Guide to y^ast 
Genetics and Molecular Biology, Methods Enzymol . Volume 194 
PP 182-187, Academic Press. Inc., New York; Ito et al j ' 
20 f ^"''^ ^^^«3>'- Hinnen et al., Proc. Natl. Acad. 

20 rC7.S.A.; 75:1920 (1978), all of Which are herein 

incorporated by reference in their entirety. 

The present invention also relates to methods of 
producing the protein or fragment thereof comprising culturing 
---°^-nant fungal host cells unde^: conditions conducive 

cells of the present invention are cultivated in a nutrient 
medium suitable for production of the protein or fragment 
thereof using methods known in the art. For example, the cell 
may be cultivated by shake flask cultivation, small-scale or 
large-scale fermentation (including continuous, batch, fed- 
batch, or solid state fermentations) in laboratory or 
industrial fermentors performed in a suitable medium and under 
conditions allowing the protein or fragment thereof to be 
expressed and/or isolated. The cultivation takes place in a 
35 suitable nutrient medium con^rising carbon and nitrogen 

sources and inorganic salts, using procedures known in the art 
(see, e.gr., Bennett and LaSure (eds.). More Gene Manipulations 
in Funsri, Academic Press, CA, (1991), the entirety of which is 
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herein incorporated by reference) . Suitcible media are 
available from commercial suppliers or may be prepared 
according to published compositions (e.g., in catalogues of 
the American Type Culture Collection, Manassas, VA) . If the 
5 protein or fragment thereof is secreted into the nutrient 
medium, a protein or fragment thereof can be recovered 
directly from the medium. If the protein or fragment thereof 
is not secreted, it is recovered from cell lysates . 

The expressed protein or fragment thereof may be detected 

10 using methods known in the art that are specific for the 

particular protein or fragment . These detection methods may 
include the use of specific antibodies, formation of an enzyme 
product, or disappearance of an enzyme substrate. For 
example, if the protein or fragment thereof has enzymatic 

15 activity, an enzyme assay may be used. Alternatively, if 

polyclonal or monoclonal antibodies specific to the protein or 
fragment thereof are available, imm\inoassays may be employed 
using the cintibodies to the protein or fragment thereof. The 
techniques' of enzyme assay and immunoassay are well known' to 

20 those skilled in the art. 

The : resulting protein or fragment thereof may be 
recovered by. methods known in the arts. For example, the 
protein- or fragment thereof may be recovered from the nutrient 
medixam by : conventional procedures including, but not limited 

25 to,i centr:ifugation> filtration, extraction, spray-drying, 
evaporation, or precipitation. The recovered protein or 
fragment thereof may then be further purified by a variety of 
chromatographic procedures , e.g., ion exchange chromatography, 
gel filtration chromatography, affinity chromatography, or the 

30 like. 

Methods for the expression of DNA sequences of interest 
in insect host cells are also well known in the art, and are 
reviewed by Lucow and Summers , (1988) Bio/technology 6:47-55, 
the entirety of which is incorporated herein by reference. 
35 r In a preferred embodiment, plant host cells axe employed 

in :trans format ion experiments with constructs containing DNA 
sequences cocling 9-HPO lyase enzymes expressed from 
eonstitutiye ,or tissue ^ enhsmced promoters . 



BNSDOCID: <WO_00G057GA^J.> 



10 



15 



20 



25 



30 



35 



WO 00/50575 

24 PCTAJSOO/05311 
Depending upon the host, the regulatory regions will 
vary, including regions from viral, plasmid or chronoso-nal 
genes, or the like. For expression in prokaryotic or 
eukaryotio microorganisms, particularly unicellular hosts a 
^e variety of constitutive or regulatable promoters may'be 
e-rployed E:^ression in a microorganism can provide a rlady 
source of the plant enzyme. Smong transcriptional initiation 
regrons whxch have been described are regions from bacterial 
and yeast hosts, such as E. coli, B. .ubtiiis, Sacchromyces 
cerev.s.ae, rncluding genes such as beta-galactosidase. TV 
polymerase, tryptophan E and the like. 

r^.^y P-^^t^red embodiment, the constructs will involve 
l^l T "'^ factional in plants which provide for 

modxfred production of 9-HPO lyase, and. possibly 

frL'Td'" f '""^ composition. The ;pen reading 

tZeof ?f '^"^^ °' functional fragment 

thereof wrll be joined at its 5- end to a transcription 
initiation regulatory region. In embodiments wherein the 
expression of the 9-HPO lyase protein is desired in a plant 
host, the use of all or part of the con^lete plant HPO lyase 

IZIT "^^'^ °' 5- upstream non- 

coding regions (promoter) together with the structural gene 
se<;^ence and 3 - downstream non-coding regions may be e,Jloyed. 

native to t; ^^"" """"""" « P^<»>'«r 

native^to the plant host of interest or a modified promoter, 

x^e having transcription initiation regions derived from one 

aene source and translation initiation regions derived from a 

different gene source, numerous transcription initiation 

IZTI T for a Wide variety of 

constitutive or regulatable, e.g., inducible, transcription of 

inlt ^« '-"--^P^on/translation 

initiation regions corresponding to such structural genes are 
found immediately 5- upstream to the respective start oodons 
Among transcriptional initiation regions used for plants are 

Z t.TlTl^^'^'r^^ ^"^ structural genes such 

as for nopaline and mannopine synthases, the 19s and 35S 

promoters from CaMV. and the 5- upstream regions from other 
Plant genes such as napin, ACP, SSD. PG. zein, phaseolin E 
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and the like. Enhanced promoters, such as double 35S, are 
also available for expression of HPO lyase sequences. For 
such applications when 5 ' upstream non-coding regions are 
obtained from other genes regulated during seed maturation, 
5 those preferentially expressed in plant embryo tissue, such as 
ACP and napin-derived transcription initiation control 
regions, are desired. Such "seed-specific promoters" may be 
obtained and used in accordance with the teachings of issued 
U.S. Patent Numbers 5,608,152 and 5 , 530 , 194 , which references 

10 are hereby incorporated by reference. Transcription 

initiation regions which are preferentially expressed in seed 
tissue, i.e., which are \indetec table in other plant parts, are 
considered desirable for expression in order to minimize any 
disruptive or adverse effects of the gene product. 

15 Regulatory transcript termination regions may be provided 

in DNA constructs of this invention as well. Transcript 
termination regions may be provided by the DNA sequence 
encoding the plant HPO lyase or a convenient transcription 
termination region derived from a different gene source, for 

20 example, the treuciscript termination region which is naturally 
associated with the transcript initiation region. Where the 
transcript termination region is from a different gene souirce, 
it will contain at least about 0.25 kb, preferably about 1^3 
kb of sequence 3' to the structural gene from which the 

25 termination region . is derived. 

Plant expression or transcription constructs having a 9- 
HPO lyase as the DNA sequence of interest for increased or 
decreased expression thereof may be employed with a wide 
variety of plant life, particularly, plant life involved in 

30 the production of tissues for edible uses. Most especially 
preferred are seed, fruit, vegetable and leaf crops. Plants 
of interest include, but are not limited to, Brassica species, 
soybean, corn, tomato, strawberry, bell pepper and melon* 
Depending on the method for introducing the recombinant 

35 constructs into the host cell, other DNA sequences may be 
required. Importantly, this invention is applicable to 
dicotyledenous and monocotyledenous species alike and will be 
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readily applicable to new and/or improved transformation and 
regulation techniques. 

In order to increase lipid peroxidation, and thereby 
xncreasing "melon" flavors/fragrances, in a plant tissue 
coexpression of a plant or other 9-hpo lyase in a plant tissue 
wxth a second gene involved in lipid peroxidation may also 
fxnd use in the present invention. For exan^ie, coexpression 
of a 9-HPO lyase secjuence in a plant tissue with a DNA 
sequence encoding for another protein involved in lipid 
peroxidation, such as a lipoxygenase may increase lipid 
peroxidation and increase the total short-chain aldehydes 
produced in the plant tissue. Such an increase in short-chain 
tissue "^"^ increase the "melon' flavor in an- edible plant 

The method of transformation in obtaining such transgenic 
Plants xs not critical to the instant invention, and various 
methods of plant transformation are currently available 
Furthermore, as newer methods become available to transform 
crops, they may also be directly applied hereunder. For 
example, many plant species naturally susceptible to 
^Srrobacterium infection may be successfully transformed via 
trxpartxte or binary vector methods of A^robacterium mediated 
transformation. m many instances, it will be desirable to 
have the construct bordered on one or both sides by T-DNA 
particularly having the left and right borders, more 
particularly the right border. This is particularly useful 
When the construct uses A. tumefaciens or A. rhi.ogenes as ^ 
mode for transformation, although the T-dna borders may find 
use with other modes of transformation, m addition 
techniques of microinjection, DNA particle bombardment, and 
electroporation have been developed which allow for the 
transformation of various monocot and dicot plant species 

Normally, included with the DNA construct will be a 
structural gene having the necessary regulatory regions for 
expression in a host and providing for selection of 
transformant cells. The gene may provide , for resistance to a 
cytotoxic agent, e.g. antibiotic, heavy metal, toxin etc 
complementation providing prototrophy to an auxotrophic host 
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viral immunity or the like. Depending upon the number of 
different host species the expression construct or components 
thereof are introduced, one or more markers may be employed/ 
where different conditions for selection are used for the 
5 different hosts. 

Where Ag^robacterium is used for plant cell 
transformation, a vector may be used which may be introduced 
into the AgrobactGrium host for homologous recombination with 
T-DNA or the Ti- or Ri-plasmid present in the Agrrobactez-iuin 
10 host. The Ti- or Ri~plasmid containing the T-DNA for 

recombination may be armed (capable of causing gall formation) 
or disarmed (incapable of causing gall formation) , the latter 
being permissible, so long as the v±r genes are present in the 
transformed AfirroJbacterium host. The armed plasmid can give a 
15 mixture of normal plant cells and gall. 

In some instcuices where Agz-oJbactejrlum is used as the 
vehicle for transforming host plant cells, the expression or 
transcription construct bordered by the T-DNA border region (s) 
will be inserted into a broad host rauige vector capable of 
20 replication in E. coli cuid Agrr-oJbacteriuzn, there being broad 

host range vectors described in the literature. Commonly used 
is pRK2 or derivatives thereof. See, for example,; pitta ^ et 
al., {Proc. Nat. Acad. Sci., L7.S.A. (1980) 77:7347-7351) and 
EPA 0 120 515, which are incorporated herein by refierence. 
.25 Alternatively, one may insert the sequences to be expressed in 
plant cells into a vector containing separate replication 
sequences,, one of which stabilizes the vector in coli, aiid 
the other in Agrobacterxum, See, for example, McBride and 
Summerfelt {Plant Mol. Biol. (1990) 14 : 269-276) , wherein the 
30 pRiHRI (Jouanin, et al., Mol\ Gen. Genet. (1985) 201:370-374) 
origin of replication is utilized and provides for added 
stability of the plant expression vectors in host : 
Agrobacterium cells. 

Included with the expression construct and the T-DNA will 
35 be one or more markers, which allow for selection of 

transformed Agrobacterium and transformed plant cells. A 
number of markers have been developed for use with plcuit 
cells, such as resistance to chloramphenicol, kanamycin, the 
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ZillTZTf"""' - ">^e. The particular 

n-arker employed rs not essential to this invention, one or 
another marker being preferred depending on the particular 
nost and the manner of construction. 
5 transformation of plant cells using ^^rotecterium, 

explants may be combined and incubated with the transformed 
^^roiacte..u. for sufficient time for transformation, the 
bacteria killed, and the plant cells cultured in an 
appropriate selective medium. Once callus forms, shoot 

hoZr°"'" -Pl°^-g the appropriate plant 

hormones rn accordance with known methods and the shoots 
transferred to rooting medium for regeneration of plants. The 
Plants may then be grown to seed and the seed used to 
establish repetitive generations and for isolation of 
15 vegetable oils. 

the 9°^^ ^"^^^'-^ "-ts have been obtained which express 
the 9-HPO lyase encoding sequence, a number of methods are 

Zoll '"'^ °' ion and 

protern actrvxty. Methods for the analysis of 9-HPO lyase 

^t""°"' ""'"^^^ hybridizations and 

western rmmuno blot assays are generally described in 

Manratis, et al. ,1989) Afolecuiar Cloning, ^ ^.aborator^. 

«anual. Second Edition. Cold Spring Harbor I.aboratory. Cold 

Sprrng Harbor. New York. Methods for the analysis ^ 

lyase activity are described in Matsui, et al. ,1996) FBBS 

^t ers 394=21-24; Hehbock. et al. ,1998) .r.s.niu. / 

T,TtT.\Tr « al. ,1998) lipids 

33 ,5) =533-538, and in European Patent Application EP 0 801 133 

prese^r^ "^""^ depressing the 9-HPO lyase sequences of the 
! ° production of volatile 

>, ""'"^ products, such as 

those havrng fruit flavors. Such compounds and uses are 
described by Morris, (1981) Perfumer * i^lavorlst vol 6 nol 
and Clark, ,1990) Perfumer and Wavorist vol 15 the 
entireties of which are incorporated herein by reference. 



25 
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Host cells es^ressing the 9-HPO lyases of the present 
invention provide a novel source of volatile aldehydes 
contributing to **melon" notes for use in various applications. 
Furthermore, the host cells may also contain constructs 
5 providing for a increased production of enzymes involved in 
lipid peroxidation, for example lipoxygenase. In addition, 
the host cells may also produce an increased amount of a 
particular fatty acid, or have a general increase in fatty 
acids . Such host cells may be obtained using traditional 
10 breeding techniques, including mutagenesis, as well as hosts 
genetically engineered with such an altered fatty acid 
composition. 

Furthermore, plant host cells containing a construct 
providing for the expression of the 9-HPO lyase sequences of 

15 the present invention find use as a source for aldehydes in 

reactions for the production of alcohols for use in flavorings 
and aromatic products. Such methods are known in the art and 
are described for example in U.S. Patent Number 5,695,973 and 
in PCT Publication w6 95/26413 the entireties of which are 

20 incorporated herein by reference. Generally, a mixture of 
aldehydes and alcohols are obtained from such methods. The 
methods generally involve a reaction mixture containing at 
least one unsaturated fatty acid, a plant material having a 
relatively high amount of enzyme activity of lipoxygenase and 

25 hydroperoxide lyase, and a source of alcohol dehydrogenase. 

The unsaturated fatty acid may vary and include a single 
unsaturated fatty acid species as well as mixtures of several 
unsaturated fatty acids. The fatty acids are provided in a 
free acid form, and examples include, but are not limited to 

30 oleic acid, linoleic acid, linolenic acid (alpha and gamma 
forms) , arachidonic acid, eicosapentaenoic acid, and 
ricinoleic acid. 

Sources of the alcohol dehydrogenase include yeasts, as 
well as non-yeast molds. The alcohol dehydrogenase has the 

35 ability to convert an aldehyde to em alcohol. The yeast audi 
non-yeast molds further provide a source of nicotine adenine 
dinucleotide (NADH) as a reducing agent. 



BNSDOCID: <WO_0060S75AR.L> 



I 



wo 00/50575 

PCT/USOO/05311 

The invention now being generally described, it will be 
more readily understood reference to the following examples 

Z r.T"^^ """^^^^ °' illustration only and 
not intended to limit the present invention. 

5 

EXAMPI.ES 

ExaB^le 1 Identification of Arabidopsis HPO Lyase Sequences 

A nuclerc acid fragment encoding hydroperoxide lyase from 

10 Tt JTlZeT ^ ^"^^-^-^^ =^°-<^ and se^enced ,.^tsui, 
et ai. (1996) supra). The nucleotide sequence was used to 

TZT.°TT sequences. One accession 

Identified from Genbank (Accession Number Z97339 

(http://www.ncbi.nlm.nih.gov/web/Genbank/lndex.html)) 
containing a genomic sequence from ^abidopsis was reported to 
IS encode an allene oxide synthase. Ported to 

comparisons between the bell pepper HPO lyase, 
Arabidopsis allene oxide synthase (I,audert, et al (1996) 
^upra) and the ^abldopsis HPO lyase-like sequence from 
Genbank using Genetyx «ac (Software Development Co. Ltd.) 

s"l'l t T ^-^""^-^^ ^ lyase,like sequence ii more 
similar to the bell Pepper HPO lyase (57% identity, (see 

ZZl%'\ '° -11-e oxide synthase sequence (39* 

Identity) (see Figure 3).. ;, 

5 Example 2 Construction of .u-abiidopsis cDNA libraries 

, Total RHA from seedling, . inflorescence,' and silique 
tissues of Arabidopsis thaliana is isolated for use i^ 
construction of complementary (cDNA, libraries. The procedure 
-an adaptation of the DKA isolation protocol of Webb and 



35 



18^ -a Knapp, (1990) Plant Molec. Reporter 

Lsh I' *°"°"i°3 '«««iPtion assumes the use of Ig 

fresh weight of tissue. Frozen seed tissue is powdered by 

BBC buffer (SOmM Tris-HCl. pH 9, 0.8M NaOl, 10n« EDTA, 0 5% 

insoluble Polyvinylpolypyrrelidone,, and ground at room 
temperature. The homogenate.ls centrifuged for 5 minutes at 
12,000 xff to pellet insoluble material,. . The resulting 



BMSDCXaO: <WO_00B0575A^|_s 



wo 00/50575 



PCT/USOO/05311 



supernatant fraction is extracted with chloroform, and the top 
phase is recovered. 

The RNA is then precipitated by addition of 1 volxime RecP 
(BOmM Tris-HCL pH9, lOmM EDTA and 0,5% (w/v) CTAB) and 
5 collected by brief centrif ugation as before. The RNA pellet 
is redissolved in 0 . 4 ml of IM NaCl, The RNA pellet is 
redissolved in water and extracted with phenol /chloroform. 
Sufficient 3M potassium acetate (pH 5) is added to make the 
mixture 0.3M in acetate, followed by addition of two voliames 
10 of ethanol to precipitate the RNA. After washing with 

ethanol, this final RNA precipitate is dissolved in water and 
stored frozen. 

Alternatively, total RNAs may be obtained using TRIzol 
reagent (BRL Life Technologies, Gaithersburg, MD) following 
15 the manufacturers protocol. 

Complementary DNAs (cDNA) are obtained from the RNAs 
using the Marathon cDNA Amplification Kit (Clontech, Palo 
Alto, CA) following the manufacturers directions. 

20 Example 3 Cloning of HPO Lyase Sequences 

In order to characterize the protein encoded by the 
Arabidopsis cDNA GenBank sequence, the entire coding region 
corresponding to the Arabidopsis HPO lyase- like cDNA was 
obtained. (Figure 1) Synthetic oligo-nucleotide primers are 

25 designed to amplify the 5' and 3' ends from the HPO lyase-like 
sequence from RNA obtained in Example 2 . Primers are designed 
according to the Arabidopsis HPO lyase-like sequence and are 
used in Rapid Amplification of cDNA Ends (RACE) reactions 
(Frohman et al . (1988) Proc. Natl, Acad. Sci. USA 85:8998- 

30 9002) . Amplification of flanking sequences from cDNA clones 
are performed using the Marathon cDNA Amplification kit 
(Clontech) according to the manufacturers protocol, 

A pair of primers were designed to amplify the 5' and 3' 
regions from the Arabidopsis HPO lyase-like cDNA from the 

35 libraries described in example 2 cQ>ove. These two primers, 

HPOL28 (for 3' RACE, 5'-C6GTTCCTCTGC6CCTCTCTCGCCGGCG-3' ) and 
HPOL21 (for 5' RACE, 5 ' rrGCGGAACCGGAGGACTAAAACGCAGC-3 ' ) are 
used in PCR reactions with Adapter specific, primers (API' 5'- 
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CCATCCTAATACGACTCACTATAGGGC-3', provided in the Marathon cDNA 
Amplxfxcation Kit. For amplification of the 5' region of the 
HPO lyase-like cDNA the primers API and HPOL 21 were used and 
for the amplification of the 3' region the API primer was 'used 
in a reaction with the primer HPOL28. The cycle conditions 
used are: SA^'C for 1 minute followed by 5 rounds of 94«>C for 5 
seconds, 72'>C for 4 minutes, followed by 5 rounds of 
amplification using 94»C for 5 seconds, VQOc for 4 minutes, and 
finally 25 cycles of 940C for 5 seconds, and 68°C for 4 
10 minutes. 

A single fragment of llOO bp was obtained from the 3 'RACE 
reaction with RNA obtained from the silique tissue described 
above. TO confirm that the PCR product contained sequence 
corresponding to the HPO lyase-like sequence, a second round 
of PCR reactions using the same conditions described above was 
performed with the gel purified 1100 bp fragment. A reaction 
was performed with the primers HP0L13 (5'- 
CTTGGCGTAGTTCCTCAGCCTCTTG-3 ' ) and AP2 (5'- 
: ACTCACTATAGGGCTCGAGCGGC-3', to amplify an approximately 1000 
bp fragment as a confirmation of the HPO lyase-like sequence 
The reamplified 1000 bp fragment was gel purified and cloned" 
xnto the PCR2.1 topO vector (Invitrogen, Carlsbad, CA) to 
create the plasmid pCGN8094. 

The 5' RACE reaction produced many non-specific 
25 fragments. A 1000 bp fragment was excised from the gel and 
Cloned into the pCR2.1 TOPO(lnvitrogen) cloning vector to 
create the plasmid pCGN8091. 

Figure 1 discloses the complete nucleotide sequence of 
the Arabidopsls HPO lyase. 

30 

Example 4 Preparation HPO Lyase Expression Constructs 

A set of constructs are prepared for transformation into 
either plant or bacterial hosts to further characterize the 
Arai^idopsis HPO lyase-like sequence. The 5' RACE product in 
35 PCGN8091 was PCR amplified using the primers Alex2 (5'- 
CGGGATCCATGTTGTTGAGAACGATGGCGGCG-3 ' ) and Alex4 (5'' 
CAATC'rcCGGCGlTCTCGTCG-3'). The Alex2 primer contains the 
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restriction endonuclease site BaihHl for the convenient cloning 
of the PGR product into the pQE3 0 expression vector (Qiagen, 
Hilden, Germany) in frame with the ATG start codon of the 
vector. In addition to the oligonucleotide primers (0.2|iM 
5 each), the PGR reaction mix contained 0^2 mM each of dATP, 

dCTP, dGTP and dTTP, 1.0% glycerol, 0.2 itiM Tris-HCl (pH 8.3), 
4.6 mM KGl, 1 . 5 mM EDTA, 15 yM dithiothreitol , 7.3 lagm/ml BSA, 
1.1 mM KOAc and 0.1 units Pfu DNA polymerase (BRL Life 
Technologies, Gaithersburg, MD) . The mixtures were amplified 

10 using the following conditions: 1 cycle of 95^G for 10 

minutes; 30 cycles of 94**G for 20 seconds, 60**G for 30 seconds, 
and 72^G for 1.5 minutes; and, 1 cycle of 72°C for 7 minutes in 
a Perkin-Elmer 9800 thermocycler . The resulting PGR product 
was digested with BamHI and Hindi 1 1 and ligated into the 

15 vector PQE30 to create the vector pGGN8099. The 3' terminus 
of the Arabidopsis HPO lyase was cloned into the Hindlll site 
of pGGN8099 from pGGN8094 to create the E. coll expression 
vector pGGNSlOO 

A binary vector for plant transformation, pCGN513 8, was 

20 constructed from pGGN1558 (McBride amd Sximmerfelt, (1990) 
Plant Molecular Biology, 14:269-276). The polylinker of 
PCGN1558 was replaced as an Hindlll/jEcoRI fragment with a 
polylinker containing unique restriction endonuclease sites, 
Hindlll, SseX/PstX, No tl, BairiHI, Swal, Xbal, Pad, AscX, and 

25 Asp718. 

An antisense construct of the AraJbidop^ijsr HPO lyase- like 
nucleotide sequence was prepared for transformation of 
AraJbidqpsis. The nucleic acid sequence encoding the 5' 1000 
bp nucleotides from pGGN8091 were cloned as an EcdRl fragment 

30 into the plasmid pBluescript II SK (Stratagene, La Jolla, GA) 
to create the vector pGGN8093. The 3 ' RAGE product from 
pG(a^8090 was cloned as a Hindi 1 1 fragment into pGGN8093 to 
create a full length HPO lyase coding sequence in the plasmid 
PGGN8094. The KpnX site of pGGN8094 was removed by digesting 

35 with JCpnl and filling in the site with Klenow fragment, and 

the HPO lyase coding sequence was cloned from this plasmid as 
a S!znal fragment into the StuI site of pGGN8059 . This yields 
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the plasn^ld pCGNSlOl. The plasxnid pCGN8059 contains a multiple 
clonxng site downstream of the 35S promoter and the hsp70 
leader sequence to allow for the cloning of sequences for 
expression from the 35S promoter sequence. This vector also 
contaxns the nopaline synthase transcription termination (nos 
3 ) sequences (Fraley et al., p^oc. Natl. Acad. Sci (1983) 
80:4803-4807 and Depicker et al . , ^. ^oiec. App. . oenet 
(1982) 1: 562-573). The fragment containing the 35S 
promoter/hsp70 leader, antisense Arabidopsis HPO lyase 
sequence, and nos3 ' termination sequence was cloned from 
PCGNSlOl as a ..otl fragment into the same site of pCGN5138 to 
create the antisense expression construct pCGN8102. 

Example 5 E coli Expression 

The expression vector pCGNSlOO was transformed into E 
cola (strain M15, Qiagen, Hilden, Germany) using a calcium 
chlorxde procedure described in Maniatis, et al. ((1989) 
Molecular Cloning: A Lal^ratory Manual. Second Edition, Cold 
Sprxng Harbor Laboratory, Cold Spring Harbor, New York) 
Transformed colonies were screened by Western immunoblot 
analysis for expression of the HPO Lyase protein using 
antxbodies raised to the bell pepper HPO lyase as described in 
Shxbata, et ai . (1995) Plant Cell Pi^ysiol. 97:1059-1072 
^ Hydroperoxide lyase activity was determined by gas 
chromotography (GC) methods described by Matsui et aJ 
(1991), Phytochemlstry, 30:2109-2113 using linolenic acid 13- 
hydroperoxide as a substrate. 



TABLE 1 



30 



Sample 
8100 

Control 



Area 

24677 

4089 



nmole 
"130" 
28 



nmole/lOmin/mg 
24 



-lyase enzyme. l->^e sequence encodes a 13-HPO 

35 Exanpie 5 Identification of Additional HPO lyase Sequences 
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Additional HPO lyase-like sequences are obtained from 
tomato (Lycopersicon esculentvm Mill. Cv UC82B) green immature 
fruit (ca 20 mm) and cucimber (Cucumis sativus L. cv Suyo) 
hypocotyl (of 3 day old seedlings) tissues. Total RNA was 
5 isolated using TRIzol reagent (Gibco-BRL Life Technologies, 
Gaithersburg, MD) following the manufacturers protocol. 

Coitqplementary DNAs (cDNA) are obtained using the Marathon 
cDNA Amplification Kit (Clontech, Palo Alto, CA) following the 
manufacturers directions. 
10 The sequences of HPO lyase from bell pepper (Matsui, et 

al. (1996) supra) i banana (European Patent Application, 
Publication Number EP 0 801 133 A2) and Arahidopsis were 
aligned using ClustalW (http: //www, clustalw. genome. ac. jp/) and 



seven conserved peptide 


sequences were 


identified (see Figure 


7 


for positions, Tcd>le 2 


for a listing) 


• 






Table 2 






Peptide Sequence 


Primer Name 


Oligonucleotide Sequence 


I 


PGSYG 


HPOLIS 


5 ' -ATNCCNGGNWSNTAYGG-3 ' 


2 


QPLEEI 


HPOL2S 


5/ -CARCCNYTNGARGARAT-3 ' 






HPOL2AS 


5 ^ - ATYTCYTCiajAiaWGGYtG- 3 ' 


3 


GFNAYGG 


HP0L3S 


5'- GGNTTYAAYGCNTWYG<a^GG-3 ' 






HP0L3AS 


5 ' -CCaiCCRSANGCRTTRAANCC-3 ' 


4 


: YQPLVM 


HPOL4S 


5 ' -TAYCARCCNYTNGTNATG-3 ' 






HPOL4AS 


5 ' -CATNACNARNGGYTGRTA-3 ' 


5 


VFDEPE 


HPOL5S 


5 ' -GTNTTYGAYGANCCNGA-3 ' 






HPOLSAS 


5 ' -TCN(3GNTCRTCRAANAC-3 ' 


6 


NGPQTG 


HP0L6AS 


5 ' -CCNGTYTWNGGNCCRTT-3 ' 


7 


NKQCAAKD 


HPOIi7AS 


5 ' -CYTTNGCNGCRCAYTGYTTRTT-3 ' 



A set of synthetic oligonucleotides (Table 2) are 
synthesized for use in polymerase chain reactions with the 

20 cDNAs obtained above to identify sequences which are 

homologous to HPO lyase sequences . The PGR reactions are 
carried out using Advantage cDNA Polymerase Mix (Clonetech, 
Palo Alto,CA) using the reaction conditions according to the 
manufacturers protocol. The letter "iS" in the oligonucleotide 

25 name designates a PGR primer designed to amplify the sense 
strand, or forward reaction primer. The letters "AS" 
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designates a PGR primer designed to amplify the antisense 
strand, or reverse reaction primer. m the oligonucleotide 
sequence, the letters "N" represents an A, c, G, or a T the 
5 represents a C or a G in that position, the letter 

5 Y represents a C or a T, and the letter -R- represents an A 
or a G in that position. 

A single PGR product, of approximately 475 bp was 
amplifed in reactions containing the primers 4HPOL3S and 
11HPOL7AS, with the cDNAs obtained from both cucumber and 
tomato (described above) . The 475 bp PGR product from tomato 
and cucumber were cloned into the plasmid pGR2 . ITOPO 
(xnvxtrogen) to yield the plasmids T15 (pGGN8305) and G15 
(PCGN8309) respectively, m PGR reactions with 6HPOL4S and 
11HPOL7AS, a single product, of approximately 200 bp, was 
Obtained from amplification reactions with cDNA obtained from 
cucumber hypocotyl tissue. The 200 bp product was cloned into 
PCR2.1 TOPO (invitrogen) , to create the plasmid C17 
(PCGN8308) . 

The nucleotide sequence of each PGR product was 
determined by automated sequencing. The sequences obtained 
are compared to nucleic acid and amino acid sequences of HPO 
lyase sequences from bell pepper, ArabiCopsis, and banana 
leaf, as well as to DNA and amino acid sequences coding for 
T^lnVT" synthases from guayule ((1995) CT. Biol. Chem. 
270(15) :8487-8494), flaxseed ( (1993 ) Proc i^atl ilcad ^ci USA 
90(18) :8519-8523) and Arabidopsis. 

The results demonstrate that the T15 nucleic acid 
sequence is approximately 85% similar to the bell pepper HPO 
lyase DNA sequence and about 88% similar in the amino acid 
sequence Furthermore, the T15 sequence is also at least about 
55% sxmxlar to other HPO lyase nucleic acid sequences and at 
least about 57% similar in the amino acid sequence. m 
addxtxon, the T15 amino acid sequence is only about 41% 
sxmilar to the allene oxide synthase sequences. The C17 
sequence also follows a similar pattern of similarity to the 
HPO lyase sequences. Thus, the T15 and C17 sequences encode 
proteins highly similar to HPO lyase. 
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However, the results of the sequence comparisons (Figure 
8) demonstrate that the C15 nucleic acid sequence is between 
50% and 54% similar to the other HPO lyase nucleic acid 
sequences and about 58% similar to the allene oxide synthase 
5 DNA sequences. Furthermore, the deduced amino acid sequence 
of C15 is between about 38% and 42% similar to the HPO lyase 
amino acid sequences and about 51% similar to the AOS amino 
acid sequences. Thus, the C15 sequence encodes a protein 
which is divergent from both the known HPO lyase sequences, 

10 and is more similar to allene oxide synthase sequences. 

The T15, C15 and C17 sequences are used to search 
Genbank. Search results further confirm the sequences from 
T15 and C17 as being similar to HPO lyase sequences, while the 
sequence of C15 is more similar to allene oxide synthase 

15 sequences . 

In order to obtain full length coding sequence for T15, 
C15 2ind C17, RACE PCR reactions are employed using the 
Marathon cDNA Amplification kit (Clontech) according to the 
manufacturers protocol, and the oligonucleotides shown in 

20 Table 3. 

: Table 3 

lKMClO-1 : 5 ' -CGGTGGAGATCCTCGCCACCGGTGCCGACCC-3 ' 
2KMC10-2 : 5 ' -CTTCCTTCACGGTTGTCCTCACTTCCTCCGCCAG-3 ' 
3KMC17-1 : 5 ' -TCCAGCAGCGCTGCCCCTTTCTCTCCCCGG-3 ' 
4KMC17-2: 5 ' -CACTGTTTGTTCTTCTCGCTCGGTGTCCCCG-3 ' 
5KMC10-3 : 5 ' -GGGTCGGCACCGGTGGCGAGGATCTCCACCG-3 ' 
6KMC 10-4: 5 ' - CTGGCGGAGGAAGTGAGGACAACCGTGAAGGAAG- 3 ' 
7KMC17 -3 : 5 ' -CCGGGGAGAGAAAGGGGCAGCGCTGCTGG-3 ' 
8KMC 17-4: 5 ' - CGGGGACACCGAGCGAGAAGAACAAACAGTG- 3 ' 
9KMT15-1: 5 ' -GACTTGGTACTGGTGGACTAAGCCTAAGTGTTTC-S ' 
10KMT15-2 : 5 ' -GGCTGATAACCACAAAGAAGCTCCCCTTTC-3 ' 
11KMT15-3: 5 ' -GAAACACTTAGGCTTAGTCCACCAGTACCAAGTC-3 ' 
12KMT15 -4 : 5 ' -GAAAGGGGAGCTTCTTTGTGGTTATCAGCC-3 ' 
35 ' \ [ ~ 

PCR products from the amplification reactions with DNA 
obtained from tomato and cucumber , are cloned into pCR2.1 TOPO. 
The sequences of the 5 ' and 3 ' -RACE products from tomato 
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(PCGN8303 (5' RACE) and pCGN8304 (3' RACE)), cucumber C15 
(PCGN8302 (5 'RACE) and pCGN8306 (3' RACE)) and CI 7 (pCGN8301 
(5' RACE) and pCGN83b7 (3' RACE)) are sequenced and aligned 
with the respective sequences obtained from pCGN8305 
PCGN8309, and pCGN8308 to obtain preliminary full length 
sequences corresponding to a tomato HPO lyase-like sequence 
(Fxgure 4), a cucumber HPO lyase-like sequence (Figure 6) and 
a cucumber allene oxide synthase-like sequence (Figure 5) . 

Example 7 Preparation of Expression Constructs 

A set of constructs are prepared for transformation into 
exther plant or bacterial hosts to further characterize the 
novel sequence from cucumber. To create a full-length coding 
sequence for the cucumber (C15) allene oxide synthase-like 
sequence, the sequences from the 5' RACE (pCGN8302) and 3' 
RACE (PCGN8306) were PCR amplified and combined at a unique 
restriction endonuclease site. 

The 5' C15 sequence is air5>lified using primers (4KMC15ES1 
5'- CGGQATCCATGGCTTCTTCCTCCCCTGAACTTC-3 ' and 5KMC15EAS2 5'- 
TGCCGACCCATTTCAGTATAGTGGG-3') in PCR an^lif ication reactions 
described above. The primer 4KMC15EAS1 amplifies from the 5' 
region and contains the start codon (ATG) , and a BaMJI site 
The 3' C15 sequencers amplified using the API primer provided 
xn the Marathon Kit (BRL-Lif etechnologies, Gaithersburg, MD) 
and the primer 6KMC15ES3 (5'- TTCACACCATTCCCGTGCCTTTCTTGCC- 
3^) . The sequence of the CIS full length clone is shown in 
Figure 6. 

A. Bacterial Expression Construct 

30 . ux''''^"' "^""^ -Plification product is digested with 

30 BaMil and .Cbal (unique site endogenous to the CIS sequence) 

and cloned into the expression vector pQE30 (Invitrogen) with 
the amplification product of the 3' RACE PGR reaction digested 
with Xbal and Smal. This construct provides a full length 

encoding sequence of the CIS cDNA in the f ^r^i ^ 
^f. '-i-'iNA in cne s. coli expression 

35 vector to create the vector PCGN8333. The full length 

sequence is also cloned into the plasmid pUC119 to create the 
vector PCGN8334. 

B. Plant Ejcpression Construct : • 



20 



25 
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A binary vector for plant transformation, pCGN5138, was 
constructed from pCGN1558 (McBride and Summer felt, (1990) 
Plant Molecular Biology, 14:269-276). The polylinker of 
pCGN1558 was replaced as an Hindlll/ScoRI fragment with a 
5 polylinker containing unique restriction endonuclease sites, 
Hindlll, SseX/ Ps tl, NotX, BanMl, Swal, Xbal, Pad, AscI, and 
Aspl 18. 

The full length coding sequence of C15 is cloned to be 
expressed from the plant constitutive promoter 35S for 
10 expression in plants. The expression cassette is cloned into 
the binary vector pCGN5138 to create the vector pCGN8337. 

Example 8 Expression of cucumber C15 in E. coli 

The expression vector pCGN8333 was transformed into E 

15 coll {strain M15, Qiagen, Hilden, Germany) using a calcium 
chloride procedure described in Maniatis, et al. ((1989) 
Molecular Cloning: A Laboratory Manual, Second Edition, Cold 
Spring Harbor LsJ^oratory, Cold Spring Harbor, New York) . 
Transformed colonies were screened by Western immiinoblot 

20 analysis for expression of the HPO Lyase protein using 

antibodies raised to the bell pepper HPO lyase as described in 
Shibata, et al. (1995) Plant Cell Physiol. 97:1059-1072. 

Hydroperoxide lyase activity was determined by 
spec tropho tome trie and gas chromotography (GC) methods 

25 described by Matsui, et al. (1991), Phytochemistry, 30:2109- 

2113, using both linolenic acid 13 -hydroperoxide and linolenic 
acid 9 -hydroperoxide as substrates. 

The results of the gas chromotography assay (Figure 9) 
demonstrate that the protein encoded by the cucumber C15 

30 sequence has greater activity toward linolenic acid 9- 

hydroperoxide (Figure 9B) substrates than linolenic acid 13- 
hydroperoxide substrates (Figure 9A) . The results of the 
spec tropho tome trie assays further demonstrate the preference 
of the protein encoded by cucumber HPO lyase nucleic acid 

35 sequence for 9 -Hydroperoxide substrates. The results of the 
spec tropho tome trie assay are presented in Figure 10. 
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Thus, the cucumber CIS sequence represents the first 
known cloning of a nucleic acid sequence encoding a 9- 
hydroper oxide lyase. 

All publications and patent applications mentioned in 
this specification are indicative of the level of skill of 
those skilled in the art to which this invention pertains 
All publications and patent applications are herein 
incorporated by reference to the same extent as if each 
individual publication or patent application was specifically 
and individually indicated to be incorporated by reference 

Although the foregoing invention has been described in 
some detail by way of illustration and example for purposes of 
clarity of understanding, it will be obvious that certain 
Changes and modifications may be practiced within the scope of 
the appended claim. 



BNSOCXSICk •iWO_0060678A«J_> 



wo 00/50575 



41 



PCT/USOO/05311 



Claims 

What is Claimed is: 

I . An isolated nucleic acid sequence coding a 
hydroperoxide lyase with activity toward fatty acid 9- 

5 hydroperoxides . 

2- The sequence according to Claim 1, wherein said 
sequence has activity toward linolenic acid 9 -hydroperoxide. 

3. The sequence according to Claim 1, wherein said 
sequence is obtained from a plant source. 

10 4. The sequence according to Claim 1, wherein said 

sequence is obtained from cucumber. 

5. The sequence according to Claim 1, wherein said 
sequence is obtained from cuciamber hypocotyl. 

6. The sequence according to Claim 1, comprising the 
15 sequence shown in Figure 6. 

7. A construct comprising a promoter fxinctional in a host 
cell, a sequence encoding a 9 -hydroperoxide lyase, and a 
trsmscriptional termination sequence. 

8. A construct according to Claim 7 wherein said 9- 
20 . hydroperoxide lyase sequence is isolated from a plant. 

9. A construct according to Claim 7 wherein said 9- 
hydroperbxide lyase sequence is isolated from cucumber. 

10. A method for increasing the resistance of a plant to 
a plant pathogen comprising expressing an HPO lyase from a 

25 construct according to Claim 1, wherein said HPO lysase 
encoding sequence is linked to DNA sequence capable of 
directing expression in a plant cell, 

II. A method for increasing the volatile composition of a 
plcoit comprising expressing an HPO lyase from a construct 

30 according to Claim 7 , wherein said HPO lysase encoding 
sequence is linked to DNA sequence capable of directing 
expression in a plant cell. 



BNSCCCID: 'cWQ__0050S75A?_l_> 



wo 00/50575 J I 



PCT/USOO/05311 



u 
< 

o 
c 

Jl 

o 
JJ 

C ■ 



o 

U 
O 

c 
o 



o 







XI 








U 




c 




3 




U 
















i 








O 




•v.. 




0) 




0) 




<d 




>f 




















CO 




8- 


in 




00 




*n 






i:) in 




<d oo 


1 












c 




o 








XJ 


(» 




M 


CO 




o 





(D (U <D 

£ u u 
fltf c c 
coo 

O 3 D 

—I cr 0* 
^ o o 

Cu CO u 




BNSCX3CID: <WO_0060575A?_I_> 



wo 00/50575 



2 / 13 



PCT/USOO/05311 




o O 
o < 

<N g W 

S O 

l> 

o < 
m H >i 

u 

< . 
go 
u 

o O ft* 

CD 

-4 ^ « 

o O 

rH O W 

o w 
u 

^ 

^ O Q 

> 

o 5 2 
in 

O < 

< a: 

< s 

o 

o U 
m L) Ou 

^ g 

o g O 
^ K o 
u ^ 

o O 

iH 



o < H 

rH O > 



rt: »-• 

< 

O Q 
o O 

f>a < « 

U 

r 
=^ 

S S u 

u o 



U 

o z 

O) 

U 

O < 

u 
o 

o o 

vr> U 

<N U Oi 

u 
u 

^ ^ ^ 

tn ^ H 

iH *^ 
O O 

o 

o o O 

^ o 

^ a 2 



BNSOOCID: <WO_0060576Aa.L> 



wo 00/50575 



3 / 13 



PCT/USOO/05311 



C GENETVX : Amino Acid 

*o Acxa Sequc. .e Homology Data ] 

1st Amino Acid Sequ^e 

IS^Size : Jff i^-i- lyase 

2nd Amino Acid Sequence 

Fxle Name . n^n r^«r*^*. -i 

Sequence size i JIq ^^^^^ ^ 

CZnit Size to compare = 2 



157.0% / 470 aa] 



10.- ■---~™-«^«v™s^,,,i,^i-^^ 

RVCGSGSDUJFKTVNEMPTW 

41. ■ ?=-°?VCE,?™LV-«^ 

479' TGDSGSIKAWKAK 



479' TGDSGSIKAWKAK 
. .*** » 

467- SFSSGSLTSVKKAC 
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( CTMETYX : Amino Acid Sequence Honiology Data ] 

1st: Amino Acid Sequence 

File Name : Arabidppsis lyase AA 

Sequence Size : 492 

2nd Amino Acid Sequence 

File Name : Arabidppsis ADS AA 

Sequence Size : 517 

Unit Size to conpare = 2 



[39-2% / 485 aal 
1' 



MLLFTMAATSPRPPPSTSLTSQQPPSPPSQLPL!^^ 
1- M^^ISTPFPlSriO'KT^/PSKPt^^ 
40 • ^LVGPLSra^ 

100 • VKSFSHLFDMDLVDprvXIC;^ 

ISO' SKWWl^EIi?SNUnT7CriESE^ 

179- HNI^?E^QA'^k£ii^££WpLld:^a^^ 

220- NGWRTOTNLAWVI^TAKt^^ 

238- DAPGLITKWvdli£jiPLLSTC 

280' tJ*I^EEFGI.TRrffiAIq^J^Ja^rtXSFl^ 

296- L-VEADKI/SISREEATHSir^ATSrOT^^ 

340' C-GSGSDU^FKTVNEMELVKSVVVErL^ 

3 55 - IKSNGGELTMSAIEKMELTKSVVYBCr^^ 

399* LCOTQPLVMRDftNVFDEPEEF^ 

• *• **...** ***** *.** ****.r *♦ 

415 ■ LYGVQPEiATIUDPKIFDPiUaEFVPERFVGEEXSEKIjI 
459' rVTLTASLLVADLFLRVTOITGDSGSIK^^ 
475 • FVVIA/aRLF^marRKm SFJJ 

Ff-g.3 
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tOENETYX: Translation of Nucleotide into Anuno Acids] 
1998.06.24 

Filename , kmtlS-l/full copy 

Sequence Size : 1901 
Sequence Position: 2 - 1901 



10 20 30 40 

420 430 ^ T K R M B K 

530 540 ^ ^ ^ ^ V I, o V K S P s 

810 • 820 830 * ^ Q J' I. F M F p s I,, 

920 ,30 ° ^ " I- A X Q I. A P T V 

1210 1220 1230 1240 R K E V R 
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1410 1420 1430 1440 1450 1460 1470 1480 14'90 1500 

agcctttagttatgaaagatccaaaggtgtttgatgaacctgaaaagtttgtgttggagagatttacaaaggaaaaagggaaagaattgctgaattattt 
PLVMKDPKVFDEPEKFVLERFTKEKGKELLNYL 

1510 1520 1530 1540 1550 1560 1570 1580 1590 1600 

gttttggtctaatggtccacaaactgggagacctactgaatcaaacaagcaatgtgctgctaaagatiatggttactctaactgcttctttgattgtggct 
FWSNGPQTGRPTESNKQCAAKD M V T I. T A S.I. I V A 

1610 1620 1630 1640 1650 1660 1670 1680 1690 1700 

tacattttccaaaagtacgattctgtgagtttctcatctggttcactcacatctgtgaaaaaggctagttgagttctgggctgtgtagctgcaaaatcta 
Y IFQKY DSVSFS SGSLTSVKKA S* 

1710 1720 1730 1740 1750 1760 1770 1780 1790 1800 

agcttgctagacatgttgatcttaagtctcggtggctggcttttccttcctccccctattgtgtaatatMWitttcacaaataaaagttcaacatgagtgt 

1810 1820 1830 1840 1850 1860 1870 1880 1890 1900 

acttattttctgttctgacttatgactattgtctatcctcttacaaattaaqttgqaacataaatctaaqttataattacaAaaaaaq^aaaaaaaaaaa 
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I<^=«HrvX: Translation of »«cleotid. into toino Ada,, 
1998.06.24 

Filename : ci7/full copy 

Sequence Size : 1712 
Sequence Position: 1 - 1712 



310 3J0 '*»'*«"««~»=«=MttteoTO.tCt„t 

^^^^^^^^^^ 

410 420 ''^■'«'*"'9»e«tggaaatcgtgg«gaaaa 

610 S20 S30 ='*"»«=«»tgtg9gaesgcattBa&ttagaa 

at„c..a,aac...«atc,,,tttc.aa.aetttcSLacc.^- SSO ,„o 

710 730 ""^V^-^ctctcgccwcgccgacactgcaaaatcac 

c«aagt„caa.atccggatacatagacgtcataat:;:,ctcggc:?Lg^^^^ „o 30O 

810 820 830 "'***'*'=*'^'=*«"*"««tctcc.gccccttgaa»a«it 

actccttcactctttccgcttaccctttttccccattjcltcccgc^^^^^ 8S0 ,„o 

910 920 „„ "*^"'=""=*^9»»«K»9caga.gttgttgagega 

940 OCA 

"10 U3, ■:'~">"w=..-«|»,.«.et«c^t^ 

U20 1330 "*"'*"'»~"»=W'0>Wt«..t«.OCOTTOC 

"°"~""~""='™"««--«™=^««,;t"„„„^^^^^^^ 13.0 1... 

l«o 1.3, "--.."^cc ^m^^,^.^ 

"c™„_,^^^^0^^1.3J^,^^3«. ^^^^ 
ISIO 1520 1S30 **™'=»*"'^«=9Ct9gggaaggaggctccac 

aeaaaaaaaoaa j^, 

rig. 5 
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[GENETYX: Trails latlon of Kucleowxde inco Amino Acids) 
1998.06.24 

Filename : CIS full/cDKA copy 

Sequence Size : 1813 
Sequence Position: 2 - 1813 



10 20 30 40 50 60 70 80 90 100 

ccactctcttccaacgtgaagataagaaaccttttgtttactttttgtacgatcacaggtcacagcaatggcttcttcctcccctgaacttcetctcaM 

MASS SPELPLK 

110 120 130 140 150 160 170 IBO 190 200 

cccattcccggtggctatggcttccccttcctcggtcccatcaaagaccgttacgattacttctatttccaaggtagagacgaattcttccgttcccgaa 
PIPGGYGFPPI^GPIKDRYDYFYFQGRDEF FRSRI 

210 220 230 240 250 260 270 280 290 300 

ttaccaaatacaactccaccgtcttccacgccaacatgccaccgggccccttcatctcctccgattccagagtcgttgtcctcctcgatgccctcagttt 
TKYN STVFHANMP PGPFI S SDSRVVVLLDALSF 

310 320 330 340 350 360 370 380 390 400 

tcccatcctcttcgacaccaccaaagtcgagaaacgcaacattctcgacggaacttacatgccctccttgtccttcaccggcggcattcgcacctgtgct: 
PII.FDTTKVEKRNI1.D6TYMPSLSFTGGIRTCA 

410 420 430 440 450 460 470 480 490 500 

tactitggacccaticggaaacagagcacactgttctcaaacgcctctttctctcctttctegctteecaccatgacaggttcatccctctgettcgaagcc 
YLDPSETB HTVLKR1.FLSPLASHHDRFIPI,PRSS 

510 520 530 540 550 560 570 ^80 590 600 

ccttgtctgagatgtttgttaagcttgaagataaactcgccgacziaaaataagatcgctgatttcaactcgattagtgatgccgtgtcgtttgattatgt 
L S E M F V K I# E D K X A D K N K l A D F N S I S D A V S F D Y V 

€10 620 630 640 650 660 670 680 690 700 

tttccgtttattctccgatggaacccctgattcgacattagctgctgatggacccggaatgttcgatttatggcttgggcttcaacctgccccattggct 
FRL F SD GTPDSTLAADGPGMFDI.WLGLQI.A PI,A 

710 720 730 740 750 760 770 780 790 800 

tccattggccttcccaaaattttctctgtttttgaagatctcattattcacaccattcccctgcctttcttcccagccaagagtcgttacaggaagcttt 
S I G L P K I F. S V F E D L I I H T I P L P F P P V K S R Y R K L Y 

810 820 830 840 850 860 870 880 890 900 

ataaagcgttttactcctcctctggctcatttctagacgaagcagagaaacaggggalMigacagagagaaagcatgticacaatttSigtgtttcttgctgg 
KAFYSSS GSFLO E AE KQGIDREKACHNLVFLAG 

910 920 930 940 950 960 970 980 990 1000 

attcaacgcatacgggggaatgaaagtcctttttcccactatactgaaatgggtcggcaccggtggcgaggacctccaccgtaaaccggcggaggaagtg 
F NAY GGMK VLF PT1LKWVGTGGBDLHRKI.AEEV 

1010 1020 1030 1040 1050 1060 1070 1080 1090 1100 

aggacaaccgtgaaggaagaagggggactgactttctccgccttggagaaaatgagtccgctgaagccggtcgcgtacgaagctctgaggatcgaaccgc 
RTTVKEEGGLTFSAtEKMSLLKSVVYEALRIEPP 

1110 1120 1130 1140 1150 1160 1170 1180 1190 1200 

cggtgccgttccagtacgggaaagcgaaggaggatatcgtgattcagagccacgattcttgtctcaagatcaaaaaaggggagacgatttttggttatca 
VPFQYGKAKEDIV IQSHDSCFKIKKGETIFGYQ 

1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 

gccgttcgctaccaaagatccgaagatctttaaggactcggagaagttcgtgggcgataggttcgtgggagaagaaggggagaagcctttgaagtatgtt 
PFATKD PKIFKDSEKFVGDRFVGEEGEKLLKYV 

1310 1320 1330 1340 1350 1360 1370 1380 1390 1400 

cactggtcaaatgagcgggagacggtggagccgacggcggagaacaagcagtgcccggggaagaatctggtggcgatgatgggcaggattaccgtggtgg 
YWSNERETVEPTAENKQCPGKNLVVMMGRIIVVE 
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Sequence type explicitly set to Protein 
Sequence fonnat is Peaison 
Sequence 1: gi I 1272340 480 aa 

Sequence 2 : banana 483 a a 

Sequence 3 : ARABIDOPSIS 492 
Start of Pairwise alignments 
Alicihing. . . 

Sequences (1:2) Aligned. Score: 
Sequences (1:3) Aligned. Score: 
Sequences (2:3) Aligned. Score: 
Start of Multiple Alignment 
There are 2 groups 
Aligning . . . 

Group 1: Sequences: 2 Score: 4874 

(3 roup 2: Seqviences : 3 Score: 4967 

Alignment Score 5148 

CLUSTAL-Alignroent file created [ clustalw. aln) 
CLUSTAL W (1.7) multiple sequence alignment 



aa 



51 
55 
53 



gill272340 

banana 

ARABIDOPSIS 



gill272340 

banana 

ARABIDOPSIS 



gill272340 

banana 

ARABIDOPSIS 



gill272340 

banana 

ARABIDOPSIS 



MIPIMSSAPLSTATPISLPVRKIPGSYGFPLLGPLWDRLDYNWFQ 

MAMMWSSASATAVT- -TLPTRPIPGS YG PPLVG PLKDRLDYFWFQ 

MLLRTMAATSPRPPPSTSLTSQQPPSPPS—QLPLRTMEaSlfiWPLVGPLSDRLDYFWFQ 
:*.,:.: ** « . ***^* ***** *** 

KLPDFFSKR^^KYNSTVFRT^n^PP(rFPFFLGVNP^IWAVLDVKSPAHLFDMEIVEKANVL 
GPETFFRSRMATHKSTVTRTbmPPTFPFFVlSVDPRVVTVLDcrTSPSALFDLQVVEKKNIL 
GPDKFFRTRAEKYKSTVFRTNIPPTFPFFI3NVNPNIVAVLDVKSFSHLFDMDLVDKRDVL 



*•* *♦ . *«« , 



VGDFMPSWYTGDMRVCAYLDTSEPKHTQIKNFSLDILKRSSKTWVPTLVKELOTLFGTF 

IGDYMPSLSFTGDTRVWYLDPSEPDHARVKSFCLQLLRRGAKTWVSSFLSNLDVMLATI 

IGDFRPSLGFYGGVRVGVYLDTTEPKHAKIKGFAMETLKRSSKVWLQELRSNLNIFW3TI 
:**: **: : *. ** .***.:**.♦:::*.*.:: . . *. 

ESDLSKSKSASLLPALQKFLFNFFSLTFU3ADPSASPEIANSGFAYLDAWLAIOLAPTVS 

EQ<3IAKD<3SAGLFGPLQKCIFAFLCKSIIGADPSVSPDVGENGFVMLDKWLALQLLPTVK 

ESEISKNGAASYIFPLQRCIPSFLCASLAGVDASVSPDIAEN(3WKTINWLAl6vIPTAK 
*. :*. : **: :* »?. * * »*.. . ». .. *★*.♦". ** 



gill272340 

banana 

ARABIDOPSIS 



gill272340 

banana 

ARABIDOPSIS 



gill272340 

banana 

ARABIDOPSIS 



gill272340 

banana 

ARABIDOPSIS 



gill272340 

banana 

ARABIDOPSIS 



IG-'/LQPLEEIFVHSFSYPYFLVRGGYEKLIKFVK3EAKEVLTRAQTDFQLTE0EAIHNL 

VGAIPQPLEEILLHSFPLPFFLVSRI)YRKLYEFVEX0(30EVVRRAETEHGLSKHDAINNI 

LGV\^QH*EEXLLHTWPYPSLLIAGhr^KKLYNFIDENAGCxrLRLGOEEFGLTRDEAIQNL 
;* ; ******. * .* .*.** :*:..:. - . .* . .**.», 

^ • ••••• • a mm mm «••■ . , 

lpilgfnafggftiflptllgnlgdbknaemqeklrkevrekvgtnoen-lsfesvkemfe 
lpvlgfnafggfsvffptllttigrdk-tri'lreklkdevrrvmksrgekrpsfetvreme 
lfvlgfnaygqfsvflpsligritgdn-sglqerirtevrrvcgs^gsd-lnfktvneme 

**:***♦*:♦**::*:♦:* *... *** . *..* *** 

^— ^ ^HroU^ 

LVQSFVYESLRLS£>PVPSQYARARKDFMLSSHDSVYEIKKGELLCGYQPLVMKDPKVFDE 

LVRSTVYEVLRLNPPVPLQUGRARTDFTLNSHDAAFKVEKGELLcrGYQPLVMRDPAVFDD 

LVKSWYETLRFSPPVPLQFARARKDFQISSHDAVFEVICKGELUJG iQELYMRDANVEDE 
**.* *** **.**** * *»♦ ** .***... J .*♦*♦*♦*♦♦♦***.* ^ ***., 

PEKFMLERFTKEKGKELL^^yLFWSNGPQTGSPTES^mQ^:•AAKDAVTLT^L^V^ 

PETFAPERFMGS-GKELLKYVFWSNGPETGTPTPANKO<:AAKDYWETArLLMAEIFYRY 

ZEEFKPDRYVGETGSELLtnfLYWSHOEQiaTPSASmOjymiVTLTASLLVADLFLRY 
** * :*: . *.***:*::*«««*:**.*. . i^* * * » * * *^ ***..* .* .* 

DSVSFSSGSLTSVXKAC ^ // fJC^ ffJ^ J? 

DEFVCADDAISVTKLDRAREWE ^ 
DTITGDSGSIKAWKAK "X 



R9.7 
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SEQUENCE LISTING 

<110> Matsui, Kenji 

<120> Patty Acid 9 -hydroperoxide Lyase Nucleic 
Acid Sequences 

<130> 15457/00/WO 

<140> New Application 
<141> 2000-02-25 

<160> 44 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1385 
<212> DNA 
<213> Arabidopsis 

<400> 1 

atggcggcga cttccccgcg gccaccaccg tcaacatccc taacatctca gcagccacca 60 

tcacccccct cacagcttcc cctccgtaca atgccgggat cgtacggctg gccgttggtt 120 

ggaccattat cggaccgttt agattccaag gacccgataa gtttttccgg acaagagctg 180 

agaagtataa gagcactgtg ttccgtacaa atattcctcc gacgtttcct ttcttcggca 240 

acccatcttt ttgacatgga tctagttgat aaaacfagatg ttctcatcgg agacttccgg 300 

cctagccttg ggttctacgg cggcgttcgt gttggagttt atctcgacac tactgagcca 360 

aagcacgcca aggttcgtac atacgttgct ttgctactat tacataaaag gtttcgctat 420 

ggaaacacta aaacgaagct caaaagtatg gctacaagag cttcgttcaa acctaaacat 480 

tttctgggga acaatcgaat ccgaaatctc caaaaacggt tcctctgcgc ctctctcgcc 540 

ggcgttgacg cttccgtatc gccggacatc gctgagaacg gttggaaaac aatcaatact 600 

tggcttgcgt tgcaagttat tcccactgct aaacttggcg tagttcctca gcctcttgaa 660 

gagattttac ttcatacttg gccttatcct tctctcttaa tcgccggaaa ttacaaaaag 720. 

ctttacaatt tcatcgacga gaacgccgga gattgtctcc ggttaggtca agaagaattc 780 

gggttgaccc gagatgaggc tattcaaaat cttctctttg ttttaggttt taatgcctac 840 

gggggctttt ccgtcttctt accttctttg atcgggagaa taaccggcga caattccggt 900 

ttacaggaga ggattagaac cgaagtcagg agagtttgcg gatccgggtc ggatcttaat 960 

ttcaagacgg ttaacgaaat ggagctggtt aaatccgtgg tttacgaaac gctgcgtttt 1020 

agtcctccgg ttccgctgca attcgcacgt gcgaggaaag attttcagat aagttcacac 1080 

gatgctgttt ttgaggtcaa gaaaggtgag cttctttgtg gttatcagcc gcttgtgatg 1140 

agagacgcta atgtttttga cgaaccggag gaatttaaac cggaccggta tgtaggtgag 1200 

accgggtctg aattgctgaa ttatctctac tggtctaacg gtccacaaac cggtaccccg 1260 

agcgcgtcga acaaacagtg tgcagctaag gacattgtca ctctcacggc ttccttgctc 1320 

gttgccgatt tatttctccg - atcaaagact tcaaaaactg aaacacaaca accatcatcg 1380 

tatgg 1385 

<210> 2 

<211> 462 

<212> PRT 

<213> Arcibidopsis 

<400> 2 

Met Ala Ala Thr Ser Pro Arg Pro Pro Pro Ser Thr Ser Leu Thr Ser 
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1 5 
Gin Gin Pro Pro Ser Pro Pro Ser Gin IL 

Gly Ser Gly Trp Pro Leu Val cfy Pro 

Ser Lys Asp Pro He Ser Phe ler Gly Gin 

Ala Leu Cys Ser Val Gin "e Phe Leu Arg 
" , 70 

Thr His Leu Phe Asp Met Asp Leu Val Asp 

Gly Asp Phe Arg Pro Ser Leu Gly Phe Tyr 

Val Tyr Leu Asp Thr Thr Glu Pro tyl His 

Val Ala Leu Leu Leu Leu His tyl Arg Phe 

^ Lys Leu Lys Ser Met HI Thr Arg Ala 

Phe Leu Gly Asn Asn A^g He Arg Asn Leu 

Ala ser Leu Ala Gly Val Asp Asp Ala Hi 

Glu Asn Gly Trp Lys Thr He Asn Jhl Trp 

200 

Val Val Pro Gin 

215 

Leu His Thr Trp Pro ^ Pro Ser Leu Leu 

Lys Leu Tyr Asn Phe He Asp Glu Asn Ala 

Gly Gin Glu Glu Ihl Gly Leu Thr Arg Ifp 

Leu Phe Val Leu Gly Phe Asn Ala ^ Gly 

280 

Pro ser Leu He Gly Arg He Thr Gly Asp 

Arg He Arg Thr Glu Val tej Arg Val Cys 
A 310 

Asn Phe Lys Thr Val Asn Glu Met Glu Leu 

Glu Thr Leu Arg HI ser Pro Pro Val Iro 

340 34c 
Arg Lys Asp Phe Gin He Ser Ser His Asp 

360 

Lys Gly Glu Leu Leu Cys Gly Tyr Gin Pro 

Asn Val Phe Asp Glu Pro III Glu Phe Lys 

° 390 
Glu Thr Gly Ser Glu Leu Leu Asn Tyr Leu 

Gin Thr Gly Thr Ser Ala Ser Asn lJs 

He val Thr Leu Thr Ala Ser Leu til Val 
"5 44Q 



15 



Pro Leu Arg Thr Met Pro 
30 

Leu Ser Asp Arg Leu Asp 
45 ^ 
Glu Leu Arg Ser lie Arg 
60 

Arg Phe Leu Ser Ser Ala 
7^ 80 
Lys Arg Asp Val Leu lie 

Gly Gly Val Arg Val Gly 
110 

Ala Lys Val Arg Thr Tyr 
125 

Arg Tyr Gly Asn Thr Lys 

140 ^ 
Ser Phe Lys Pro Lys His 

Gin Lys Arg Phe Leu Cys 
"5 

Val Ser Pro Asp He Ala 
190 

Leu Ala Leu Gin Val He 
205 

Pro Leu Glu Glu He Leu 
220 

He Ala Gly Asn Tyr Lys 
235 240 
Gly Asp Cys Leu Arg Leu 
255 

Glu Ala He Gin Asn Leu 
270 

Gly Phe Ser Val Phe Leu 
285 

Asn Ser Gly Leu Gin Glu 
300 

Gly Ser Gly Ser Asp Leu 

320 

Val Lys Ser Val Val Tyr 
, 335 
Leu: Gin Phe Ala Arg Ala 
350 

Ala; Val Phe Glu Val Lys 
365 

Val Met Arg Asp Ala 

?f ? Arg Tyr Val Gly 

m. « 400 
Tyr Trp Ser Asn Gly Pro 

415 

Gin Cys Ala Ala Lys Asp 
430 

Ala Asp Leu Phe Leu Arg 
445 



- 2 - 



BNSDOCtD: <WO_0060676A3J_> 



wo 00/50575 



PCTAJSOO/05311 



Ser Lys Thr Ser Lys Thr Glu Thr Gin Gin Pro Ser Ser Tyr 
450 455 460 

<210> 3 

<211> 492 

<212> PRT 

<213> ArsUbidopsis 

<400> 3 



Met 


Leu 


Leu Arg 


Thr 


Met 


Ala 


Ala 


Thr 


Ser 


Pro 


Arcr 


Pro 


Pro 


Pro 


Ser 


1 






5 










10 










15 




Thr 


Ser 


Leu Thr 
20 


Ser 


Gin 


Gin 


Pro 


Pro 
25 


Ser 


Pro 


Pro 


Ser 


Gin 
30 


Leu 


Pro 


Leu 


Arci 


Thr Met 


Pro 


Gly 


Ser 


Tyr 


Gly 


Tip 


Pro 


Leu 


Val 


Glv 


Pro 


Leu 






35 








40 










45 






Ser 


Asp 
50 


Arg Leu 


Asp 


Tyr 


Phe 
55 


Trp 


Phe 


Gin 


Gly 


Pro 
60 


Asp 


Lys 


Phe 


Phe 




Thr 


Arg Ala 


Glu 


Lys 


Tyr 


Lys 


Ser 


Thr 


Val 


Phe 


Arg 


Thr 


Asn 


He 


65 








70 










75 










80 


Pro 


Pro 


Thr Phe 


Pro 


Phe 


Phe 


Gly Asn Val 


Asn 


Pro 


Asn 


He 


Val 


Ala 








85 










90 










95 




Val 


Leu 


Asp Val 
100 


Lys 


Ser 


Phe 


Ser 


His 
105 


Leu 


Phe 


Asp 


Met 


Asp 
110 


Leu 


Val 


Aso 


Lvs 


Arg Asp Val 


Leu 


He 


Gly 


Asp 


Phe 


Arg 


Pro 


Ser 


Leu 


Gly 


Phe 






115 








120 










125 








Tvr 


Glv 


Gly Val Arg Val 


Gly Val 


Tyr 


Leu 


Asp 


Thr 


Thr 


Glu 


Pro 


Lys 




130 








135 










140 










His 


Ala 


Lys He 


Lys 


Gly 


Phe 


Ala 


Met 


Glu 


Thr 


Leu 


Lys 


Arg 


Ser 


Ser 


145 








150 










155 








160 


Lvs 


Val 


Trp Leu 


Gin 


Glu 


Leu Arg 


Ser 


Asn 


Leu 


Asn 


He 


Phe 


Tip 


Gly 








165 










170 










175 




Thr 


He 


Glu Ser 


Glu 


He 


Ser 


Lys 


Asn Gly 


Ala 


Ala 


Ser 


Tyr 


He 


Phe 






180 










185 










190 






Pro 


Leu 


Gin Arg Cys 


He 


Phe 


Ser 


Phe 


Leu 


Cys Ala 


Ser 


Leu 


Ala Gly 






195 








200 










205 








Val 


Asp 


Ala Ser 


Val 


Ser 


Pro Asp 


He 


Ala 


Glu 


Asn Gly 


Trp 


Lys 


Thr 




210 








215 










220 








He 


Asn 


Thr Trp 


Leu 


Ala 


Leu 


Gin 


Val 


He 


Pro 


Thr 


Ala 


Lys 


Leu 


Gly 


225 








230 










235 








240 


Val 


Val 


Pro Gin 


Pro 


Leu 


Glu 


Glu 


He 


Leu 


Leu 


His 


Thr 


Trp 


Pro 


Ty± 








245 










250 










255 


Pro 


Ser 


Leu Leu 


He Ala Gly Asn Tyr Lys 


Lys 


Leu 


Tyr 


Asn 


Phe 


He 






260 










265 








270 






Asp 


Glu 


Asn Ala 


Gly Asp. Cys 


Leu 


Arg 


Leu 


Gly Gin Glu 


Glu 


Phe Gly 






275 








280 










285 








Leu 


Thr 


Arg Asp Glu Ala 


He 


Gin 


Asn 


Leu 


Leu 


Phe 


Val 


Leu Gly 


Phe 




290 








295 










300 










Asn 


Ala 


Tyr Gly Gly 


Phe 


Ser 


Val 


Phe 


Leu 


Pro 


Ser 


Leu 


lie 


Gly Arg 


305 








310 










315 










320 


He 


Thr 


Gly Asp Asn 


Ser Gly 


Leu 


Gin 


Glu 


Arg 


He 


Arg 


Thr 


Glu 


Val 








325 










330 










335 




Arg Arg 


Val Cys 


Gly 


Ser 


Gly 


Ser 


Asp 


Leu 


Asn. Phe 


Lys 


Thr 


Val 


Asn 






340 










345 










350 






Glu 


Met: 


Glu Leu 


Val 


Lys 


Ser Val Val 


Tyr 


Glu Thr 


Leu 


Arg 


Phe 


Ser 






355 








360 










365 
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Pro Pro val Pro Leu Gin Phe Ala Arg Ala Ara Lys Asp Phe Gin 11. 

Ser ser His Asp Ala Val Phe Glu Val Lys Lys If J gIu Leu Leu Cys 

Oly ryr Oln Pro Leu Val Met Ar. Asp Ala If J Val Phe Asp Glu t'rl 

Glu Glu Phe Lys Pro Asp Ar. Tyr Val il^ Olu Thr Gly Ser 'oil Leu 

.au Asn Tyr Leu Tyr Trp Ser Asn Gly Pro Gin 0^ Gly ili Pro Ser 

Ala ser Asn Lys Gin Cys Ala Ala Lys Asp He Val Leu Thr Ala 

ser Leu Leu Val Ala Asp Leu Phe Leu Arg ^r Is'J xie Gly 

Asp ser Gly Ser lie Lys Ala Val Val Lys £f Lys "° 
485 49Q 

<210> 4 

<211> 480 

<212> PRT 

<213> Bell Pepper 

<400> 4 

M« 11. p„ lie 3„ ser Tto: «a Thr Pro U, 

ser Le„ Pro Vjl Ar, Lys lie Pro ol^ ^r Ty. oly Ph. Pro iL 
«ly Pro E.eu Trp A.p i,e„ Asp Tyr A» Trp Ph, oln ^Js Pro 
ASP Phe Phe s„ uy. Arg v.l Ly. ,yr A,„ v.l Phe Ar, 

Thr A.„ val Pro Pro Cys Phe Pro ^ Ph. ^„ Hy v.l p,„ ^„ 

val val ^la v,l J.„ ^p val s» Ph. 2. Hi, L» Phe Asp S2t 
Olu lie val Jlu Asn Val ^ ,i, ^ 

val Val ryr .hr Oly Asp M,. Arg J°! cy Ala Tyr Le„ it^ Thr Ser 
Glu Pro I,y. UU Thr Oln II. ^„ ^ ^ 

Ar| .,r s.r Lys Thr Trp V.l Pro T^ VaJ JJ^ oiu Asp Thr 
^ Ph. Oly Thr Ph. Olu ser A«, =er 1^ Ser .ys ..r Ala IS 

I-u Pro Ala «„ i,y, p,. pS A«. Ph. Ph. s„ Thr 

P-e Oly „a ASP pro s„ Ala Sel Pro Cl„ 11. .la Hi s„ Oly 

Phe Ala Tyr L«. ASP Ala Tg 1^ ^ i^ ij^ ^ 

|.r II. Oly val l^u Oln Pro oi. ol„ il. HI val Hi, s,r Ph. 
ser Tyr Pro Tyr Ph. 1^ v.l Ar, Oly oly ^ Olu Ly, „. fj," 

255 
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Phe 


Val 


Lys 


Ser 


Glu 


Ala 


Lys 


Glu 








260 










Phe 


Gin 


Leu 


Thr 


Glu 


Gin 


Glu 


Ala 






275 










280 


Glv 


Phe 


Aszi 


Ala 


Phe Gly Gly 


Phe 




290 










295 




Glv 


Aszi 


Leu 


Gly Asp Glu Lys Asn 


305 










^ ^ u 






Lys 


Glu 


Val 


Arg Glu 


JiJjf s 


Val 












325 








Glu 


Ser 


Val 


Lys 


Glu 


Met 


Glu 


Leu 








340 










Leu 


Axg 


Leu 


Ser 


Pro 


Pro 


Val 


Pro 






355 










360 


Asp 


Phe 


Met 


Leu 


Ser 


Ser 


His 


Asp 




370 










375 




Glu 


Leu 


Leu 


Cys 


Gly 


Tyr 


Gin 


Pro 


385 










390 






Phe 


Asp 


Glu 


Pro 


Glu 


Lys 


Phe 


Met 










405 








Gly 


Lys 


Glu 


Leu 


Leu 


Asn 


Tyr 


Leu 








420 










Gly 


Ser 


Pro 


Thr 


Glu 


, Ser 


Asn 


Lys 






435 










440 


Thr 


Leu 


Thr 


Ala 


Ser 


Leu 


lie 


Val 




450 










455 




Ser 


Val 


Ser 


Phe 


Ser 


Ser Gly Ser 



465 470 

<210> 5 
<211> 517 
<212> PRT 

<213> Arabidopsis . 
<400> 5 

Met Ala Ser lie Ser Thr Pro Phe 

1 5 
Val Arg Ser Lys Pro Leu Lys Phe 
20 

Ala Ser Gly Ser Glu Thr Pro Asp 

35 40 
Ser Lys Asp Leu Pro lie Arg Asn 

50 55 
lie Val Gly Pro lie Lys Asp Arg 
65 70 
Ala Glu Glu Phe Phe Lys Ser Arg 
85 

Tyr Arg Val Asn Met Pro Pro Gly 
100 

Val Val Ala Leu Leu Asp Gly Lys 
115 120 
Asp Lys Val Glu Lys Lys Asp Leu 

130 135 
Thr Glu Leu Thr Gly Gly Tyr Arg 
145 150 



Val 


Leu 


Thr 


Arg 


Ala 


Gin 


Thr 


Asp 


265 










270 






lie 


His 


Asn 


Leu 


Leu 


Phe 


He 


Leu 










285 








Thr 


lie 


Phe 


Leu 


Pro 


Thr 


Leu 


Leu 








300 










Ala 


Glu 


Met 


Gin 


Glu 


Lys 


Leu 


Arg 






315 










320 


Thr 


Asn 


Gin 


Glu 


Asn 


Leu 


Ser 


Phe 




330 










335 




Val 


Gin 


Ser 


Phe 


Val 


J. 


Glu 


Ser 


345 










350 






Ser 


Gin 


Tvr 


Ala 


Arg 


Ala 


Arg 


Lys 










365 








Ser 


Val 


TVTT 


Glu 


He 


Lys 


Lys 


Gly 








380 










Leu 


Val 


Met 


Lys 


Asp 


Pro 


Lys 


Val 






395 










400 


Leu 


Glu 


Arg 


Phe 


Thr 


Lys 


Glu 


Lys 




410 










415 




Phe 


Trp 


Ser 


Asn 


Gly 


Pro 


Gin Thr 


425 










430 






Gin 


Cys 


Ala 


Ala 


Lys 


Asp 


Ala 


Val 










445 








Ala 


Tyr 


He 


Phe 


Gin Lys 


Tyr 


Asp 








460 










Leu 


Thr 


Ser 


Val 


Lys 


Lys 


Ala 


Cys 






475 










480 




He 


Ser 




His 


Pro 


Lys 


Thr 




10 










15 




Arg 


Leu 


Val 


Thr 


Arg 


Pro 


lie 


Lys 


25 










30 






Leu 


Thr 


Val 


Ala 


Thr 


Arg 


Thr Gly 










45 








lie 


Pro Gly 


Asn 


Tyr Gly 


Leu 


Pro 








60 












Asp 


Tyr 


Phe 


Tyr 


Asp 


Gin Gly 






75 










80 


lie 


Arg 


Lys 


Tyr 


Asn 


Ser 


Thr 


Val 




90 










95 




Ala 


Phe 


He 


Ala 


Glu 


Asn 


Pro 


Gin 


105 










110 






Ser 


Phe 


Pro 


Val 


Leu 


Phe 


Asp Val 










125 








Phe 


Thr 


Gly 


Thr 


Tyr 


Met 


Pro 


Ser 








140 










lie 


Leu 


Ser 


Tyr 


Leu Asp 


Pro 


Ser 



155 160 
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Glu Pro ^y. Hi, ^ ^„ ^ 

S« se. ^ ^ I,S 

I-eu ... jjp 3er .eu 31„ Ly. Olu .he Pro x„„ Arg s,r c.y 

-K. »J| AT, Phe Gl„ AT, As= Ara L,„ L.u Phe «Xy s.r s„ 
Ph. I^u AT, ci„ gr AT, Arg Tyr Ly. Leu lJ2 A1. A,p Al. Pro 
Oly I^u xae Thr Trp v.l L,u Ph, As„ ^ Hi, Pro Leu L.u '41 
Thr o.y Leu Pro Ar, vel He CX. olu Leu «e Hi, ^ ?K 
.eu Pro Pro Ale Leu Val Ly, |er Asp Tyr ol„ Ar, Leu du Phe 
Leu AT, lie Ar, Oly Glu lie Leu Vel olu Al. Aep g| Leu oly He 
Ijr AT, Olu Olu Al. ™r Hie Ae„ Leu Leu Ph. US s.r Phe As„ 
Thr Trp Oly Oly ^| Lye He Leu Phe Pro Het V.1 Ly. Ar« lit 
Oly Pro Oly Oly Hie ol„ v.l Hie As„ ^ L^ «. ^35 ^ 

^ V.1 He Lys Ser Ae„ Oly oly Ol5 Leu ^r «et Oly 111 He olu 
Ly. «.j Olu Leu Thr Lye ser Val Vel Tyr olu Cys ^, 
HI Pro V.1 Thr Al. Olj Tyx oly Ar, Al. Ly, ^ Leu V.l He 
Olu ser Hi. Aep jj. ^= «0 

Oly TVr 01„ Pro L.u Al. Thr Ar, Aep Ly, He Phe A,p „. 
An. Olu Phe v.1 Pro Olu Ar, Phe v.1 Oly Olu Olu Oly X Ly, Leu 
L«. Ar, Hi, v.1 L,„ Trp s„ Asn Oly Pro Olu Thr ill Thr Pro Thr 
V.1 Oly A.„ Ly, ol„ cy, Al. oly Ly. A,p Phe ill v.l Leu Val Al. 
Arg L» Phe Val xi. olu He Ph. Ar, Ar. Asp ser Phe A,p 
Olu v.1 Oly Thr Ser Pro Leu oly Ser v.l A» Phe Ser III Leu 
Arg Lys Ala Ser Phe 510 

<210> 6 
<211> 1901 
<212> DNA 

<213> Lycopersicon eculenttun 
<400> 6 

l^T. ^^.It s^^^ c%-S^SS f^lSSS 



60 
120 
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cacatccaca 
tttatttccc 
aatgaattct 
gggcagctac 
ccaaaaaccg 
aacgaacgtg 
tttagatgtg 
gcttgttggg 
tctcgatact 
aagaggctca 
ttttgaagca 
cctctttaac 
cgccaattct 
cagcattggt 
ttttcttgtc 
agttttaagc 
tttgttcatt 
gggaaatctt 
agacaaagtc 
tgttcagtct 
aagagcaaga 
ggagcttctt 
tgaaaagttt 
gttttggtct 
taaagatatg 
ttctgtgagt 
ctgtgtagct 
ttttccttcc 
acttattttc 
tgaatctaag 



caagagctat 
tcaagcaatc 
gctcctctat 
ggattgccgt 
gaaaacttct 
ccgccgtgtt 
aaatcgtttt 
gatttcatgc 
tctgaaccta 
aaaacatggg 
gatctt:tcaa 
ttcttctccc 
ggctacatct 
gtccttcaac 
aaaggtaatt 
agggcacaaa 
cttgggttca 
ggagatgaga 
ggcgtaaatc 
tttgtttatg 
aaagatttta 
tgtggttatc 
gtgttggaga 
aatggtccac 
gttactctaa 
ttctcatctg 
gcaaaatcta 
tccccctatt 
tgttctgact 
ttataattac 



gaaatttctg 
gtatcccctt 
caactccggc 
tagtagggcc 
tcaccaagag 
ttccgttttt 
cgcatctgtt 
ccagtgttgt 
aacatgccca 
tgcctacact 
aatccaatiac 
tcactatcct 
ttcttgattc 
ctcttgaaga 
atgaaaaact 
cagagtttca 
atgcttttgg 
aaaacgcgga 
cagaaaactt 
aaacacttag 
aactgagttc 
agcctttagt 
gatttacaaa 
aaactgggag 
ctgcttcttt 
gttcactcac 
agcttgctag 
gtgtaacat:a 
tatgacta^t 
aaaaaaaaaa 



aaacaaacgt 
agcagatcga 
gccggttact 
aatcgcggat 
aatggaaaag 
cggtagtgtg 
tgatatggag 
ttatactgga 
gattaagaat: 
acttaaagag 
agcttctctt 
cggcgctgat 
atggctagct 
aattttagtc 
cgttcaattc 
actcacagaa 
tggcttctcc 
tiatgcaagag 
gagttttgaa 
gcttagtcca 
acatgattca 
tatgaaagat 
ggaaaaaggg 
acctactgaa 
gattgtggct 
atcCgtgaaa 
acatgttgat 
atttcacaaa 
gtctatcctc 
aaaaaaaaaa 



tacaatagac 
aaaaaaaaga 
ctgcccgtcc 
cga^tagact 
cacaagagca 
aatccaaatg 
attgtagaga 
gatatgcgtg 
ttittcacagg 
cttgatacaa 
cttcctgcac 
ccatctgtct 
attcaattag 
cattcttttg 
gtcaaaaatg 
caagaagcca 
attttcttgc 
aaactgagaa 
agtgttaaag 
ccagtaccaa 
gtttacgaaa 
ccaaaggtgt 
aaagaattgc 
tcaaacaagc 
tacattttcc 
aaggctagtt 
cttaagtctic 
taaaagttca 
etacaaatta 
a 



acacggcgta 
agataccgat 
gttcgattcc 
acttctggtt 
cggttttcag 
tggtggcggt 
aagctaatgt 
tttgtgctta 
atattctaaa 
tgtttacaac 
tccaaaaatt 
ccccagaaat 
cacctactgt 
catacccttt 
aagccaagga 
ttcataacct 
caactctttt 
aagaagtgag 
aaatggaact 
gtcaatatgc 
tcaagaaagg 
ttgatgaacc 
tgaattattt 
aatgtgctgc 
aaaagtacga 
gagttctggg 
ggtggctggc 
acatgagtgt: 
agttggaaca 



<210> 7 
<211> 476 
<212> PRT 

<213> Lycopersicon eculentum 
<220> 

<223> Synthetic Oligonucleotide 



<400> 7 






























Met 


Asn 


Ser 


Ala 


Pro 


Leu 


Ser 


Thr 


Pro 


Ala 


Pro 


Val 


Thr 


Leu 


Pro 


Val 


1 








5 










10 










15 




Arg 


Ser 


He 


Pro 


Gly 


Ser 


Tyr 


Gly 


Leu 


Pro 


Leu 


Val 


Gly 


Pro 


He 


Ala 








20 










25 










30 






Asp 


Arg 


Leu 


Asp 


Tyr 


Phe 


Trp 


Phe 


Gin 


Lys 


Pro 


Glu 


Asn 


Phe 


Phe 


Thr 






35 










40 










45 








Lys 


Arg 


Met 


Glu 


Lys 


His 


Lys 


Ser 


Thr 


Val 


Phe 


Arg 


Thr 


Asn 


Val 


Pro 




50 










55 










60 










Pro 


Val 


Phe 


Pro 


Phe 


Phe 


Gly 


Ser 


Val 


Asn 


Pro 


Asn 


Val 


Val 


Ala 


Val 


65 










70 










75 










80 


Leu 


Asp 


Val 


Lys 


Ser 


Phe 


Ser 


His 


Leu 


Phe 


Asp 


Met 


Glu 


He 


Val 


Glu 








85 










90 










95 




Lys 


Ala 


Asn 


Val 


Leu 


Val 


Gly Asp 


Phe 


Met 


Pro 


Ser 


Val 


Val 


Tyr 


Thr 








100 










105 










110 






Gly 


Asp 


Met 


Arg Val 


Cys 


Ala 


Tyr. 


Leu 


Asp 


Thr 


^er 


Glu 


Pro Lys 


His 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1901 
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*1. Glj II. 125 
Jhr Trp V.1 p., ^ ^ ^„ IJO 
Phe Olu Al. je„ 1.0 
I- OXn ... ^ p„ ^ 1- ^ ^ 

^ Pro .er v.1 s» Pro «1„ xl. J?! ^„ ser Gly T„ p,. 
A=P ser TTp l^ 205 

|u Gin P.O Olu Olu Xle ^„ v.l Hi. s« 1^2 «. Tyr P.o P., 
P^ 1^ V.1 oiy ^ Olu gu Olu P., v,l .y, 

Olu !,„ Olu val r^u ,x ^ 

OXU Oln Olu «a Xlo Hi. ^„ , ^= 3,0 ^ 

Jl. Oly P.e s„ Zl. p. P,o .K, ^ i?= ^ ^„ 

Jg Olu Ly, ;u„ Al, oi„ Olu Uys ^u IJ" x.„ olu V.1 A,, 

A=P .v. V.1 01. v.1 P„ Olu A=„ .eu SI P.„ olu Sor Val l^l 
Olu HO. Olu ^u v.1 ol„ s.. P.O v.1 ^ Olu T.„ ^ ^ 
Pro Pro v.1 P„ s.. ol„ ^ .1 ^ go 

S.r sej Hi. Asp ser v.1 ^ olu II. x,„ x.„ oly IS L.U L.U Cy, 
Oly IVr Ol. Pro I..U V.1 «et I.y. ASP p„ ^„ 

Olu x.ys PKe V.1 olu Ar^ p^ , J« ^„ 400 
L.U As„ ^ J.U p,. 3„ ^„ JIO ^ „5 ^ 

Glu Ser Asn Lys Gin Cvs Ala at- rtZ , . 430 

435 JJg Asp Met Val Thr l^u Thr Ala 

ser jeu Xle Val Ala Tyr lie Phe Gin .ys Tyr Asp 1*1 Val Ser Phe 
ser ser Gly Ser z.eu Thr Ser Val l,ys l,ys Ala til 

475 

<210> 8 
<211> 1712 
<212> DNA 

<213> Cucuinis sativus 
<400> 8 

gpiJS ir^J^^t l^'^t '^It^T, ..a«c.t.c ,0 

=c«=.«t, t„o=occ== „cS,to?S ttt^,^ =2SS«»J l™^. UO 
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ggtttgccgc tgctgggatc catcggtgac cgactcgatt actactggtt tcaaggaccc 240 

ga^aagttct tcaggactcg tiatggagaag aatcgaagca cggttttcag aaccaatgtt 300 

cctccgtctt tccctttcat ctccgctgat cccagagtag ttgcggttct ggattgcaag 360 

tcctttgcgc atctatttga catggaaatc gtggagaaaa ataatgttct ggtcggtgat 420 

ttcatgccca gcatcagttt caccggaaac atgagagtgt gtgcttattt ggatacctcg 480 

gaacccaatc actccaaggt aaagaacttc attacagaca ttctacggcg gagctcaagg 540 

atatggatat cggagttgga atccaaccta tcgacgatgt gggacggcat tgaattagaa 600 

atggcaaaga acaaacaatc gggtttcaga aactttctcc aaccagctct tttcaatttc 660 

ttctccaaaa ctctcgccgg cgccgacact gcaaaatcac cggaagtggc aaaatccgga 720 

tacatagacg tcataatctg gctcggcctc cagctcgttc ccaccatcca catcggcgtt 780 

ctccagcccc ttgaagaaat attccttcac tctttccgct tacccttttt ccccattgct 840 

tcccgctacc aaagactcta cgatttcttc caaaaagaag gggcagaagt tgttgagcga 900 

ggtgtcaccg agttcgggtt aacaaaggaa gaagccattc acaatctcat cttcaccatg 960 

ggcttcaacg cctatggtgg cttcagtctc ttcttccctg ttctacttga ccgtatactc 1020 

aatgataaaa ccggtctaca acaaagaatc ctcaaggaag tcaggtcaaa atccagctct 1080 

ggtctgacct tcgagtcagt caaggagatg gacctcatct actccatcgt ctacgagaca 1140 

ctccggcttg accccccagt tccctcccaa tatgccagag ccagaagatt tcaagctaac 1200 

ttcctacaat tcaacgtata acatcaagaa aggggaactg ctttgtgggt atcagccgct 1260 

ggtgatgaga gatccagagg tgttcgatga accggaggcg tttaatccag accgattccg 1320 

gggagagaaa ggggcagcgc tgctggatta tttattttgg tcgaatgggc cgcaaacggg 13 80 

gacaccgagc gagaagaaca aacagtgtgc tgggaaggat ttagtggtgc tgacgggggt 1440 

ggtgtttgtg gct:t:acatat: ttaaaaggta tgattcgatt gctggggaag gaggttccat 1500 

tacagctttt caaagggcta attgaaaagt gaaagt:gaat tatata^aca ttatgtatcg 1560 

tttattgaga gatgcacttt tcttgttcaa tggcttcttt cttatgtatg agtttggagc 1620 

caaaatggga agatttggaa tgaataaatc aataaaatca agatttiactt taccttgtaa 1680 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 1712 



<210> 9 
<211> 1813 
<212> ONA 

<213> Cucumis sativus 



<400> 9 

ccattctctt 

cacagcaatg 

cttcqccttc 

cgaattcttc 

accgggcccc 

tcccatcctc 

gccctccttg 

agagcacact 

catccctctg 

cgacaaaaat 

tttccgttta 

gttcgattta 

tttctctgtt 

gagtcgttac 

agcagagaaa 

attcaacgca 

cggtggcgag 

agggggactg 

agctctgagg 

gattcagagc 

gccgtttgct 

gttcgtggga 



ccaacgtgaa 
gcttcttcct 
ctcggtccca 
cgttcccgaa 
ttcatctcct 
ttcgacacca 
tccttcaccg 
gttctcaaac 
tttcgaagct 
aagatcgctg 
ttctccgatg 
tggcttgggc 
tttgaagatc 
aggaagcttt 
caggggatag 
tacgggggaa 
gatctccacc 
actttctccg 
atcgaaccgc 
cacgattctt: 
actaaagatc 
gaagaagggg 



gataagaaac 
cccctgaact 
tcaaagaccg 
ttaccaaata 
ccgattccag 
ccaaagtcga 
gcggcattcg 
gcctctttct 
ccttgtctga 
atttcaactc 
gaacccctga 
ttcaacttgc 
tcattattca 
atiaaagcgtit: 
acagagagaa 
tgaaagtcct: 
gCaaactggc 
ccttggagaa 
cggtgccgtt 
gttitcaagat 
cgaagatt^t 
agaagcCt^t 



cttttgttta 
tcctctcaaa 
ttacgattac 
caactccacc 
agtcgttgtc 
gaaacgcaac 
cacctgtgct 
ctcctttctc 
gatgtttgtt 
gattagtgat 
ttcgacatta 
cccattgget 
caccattccc 
ttactcctcc 
agcatgtcac 
ttttcccact 
ggaggaagtg 
aatigagtctg 
ccagtacggg 
caaaaaaggg 
taaggactcg 
gaagtatgtit: 



ctttttgtac 
cccattcccg 
ttctatttcc 
gtcttccacg 
ctcctcgatg 
attctcgacg 
tatttggacc 
gcttctcacc 
aagcttgaag 
gccgtgtcgt 
gctgctgatg 
tccattggcc 
ctgcctttct 
tctggctcat 
aatttagtgt 
atactgaaat 
aggacaaccg 
ctgaagticgg 
aaagcgaagg 
gagacgaett 
gagaagtitcg 
tactggtcaa 



gatcacaggt 
gtggctatgg 
aaggtagaga 
ccaacatgcc 
ccctcagttt 
gaacttiacat: 
catcggaaac 
atgacaggtt 
ataaactcgc 
ttgattatgt 
gacctggaat 
ttcccaaaat 
tcccagtcaa 
ttctagacga 
ttcttgctgg 
gggtcggcac 
tgaaggaaga 
tcgtgtacga 
aggat:atcgt 
ttggttatca 
tgggcgatag 
atgagcggga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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sst? i^sss^i it^tttrct 

tato^?«^== caaagtaatt tcctcgtgat ccgcgtcatt 
taaISS atcttatttt t.Lftttcl 
aaaaaaSaai aaa * ^caaatatca acttcataga 

<210> 10 
<211> 477 
<212> PRT 

<213> Cucmnis sativus 
<220> 

<223> Synthetic Oligonucleotide 
<400> 10 

Met Ala Ser Ser Ser Pro Glu Leu Pro Leu Lys 

Tyr Gly Phe Pro Phe Leu Gly Pro He lys Asp 

Tyr Phe Gin Gly Arg Asp Glu Phe Phe Arg Ser 

Asn ser Thr Val Phe His Ala Isn Met Pro Pro 

ser ASP ser Arg Val Val vll Leu Leu Asp Ala 

Leu Phe ASP Thr Thr Lys Val Glu Lys Arg 

Tyr Met Pro Ser Leu Ser Phe Thr Gly G^y He 

Leu Asp Pro Ser Glu Thr Glu His 0^ Val Leu 

ser Phe Leu Ala Ser His His Arg Phe He 

Ser Leu Ser Glu Met Phe Val Lys Leu Glu Asp 

Asn Lys He Ala Asp Phe Asn Ser He Ser A^p 

Tyr val Phe Arg Leu Phe Ser Asp Gly J^J p^o 

Ala Asp Gly Pro Gly Met Phe Asp Leu Trp Leu 

Pro ^eu Ala ser He Gly Leu pJ2 Lys He Phe 

Leu He He His Thr He Pro Leu Pro Phe Phe 

Tyr Arg Lys Leu T^ Lys Ala Phe Tyr Ser III 

Glu Ala Glu Lys Gin Gly He Asp Arg g!2 Lys 

val Phe Leu Al^ Gly Phe Asx, Ala ^ Gly Gly 
-^'^ 280 ^ ^ 



aagaatctgg 
acgttcactg 
accagagcca 
gaataaagga 
acatgacaca 
tttaatttga 
tttttctcaa 
caaaaaaaaa 



tggtgatgat 
tggacgtcgc 
ccgcttcggt 
agggtgttcg 
tcattgattt 
gatctcattg 
ctatgctcca 
aaaaaaaaaa 



Pro He 
Arg Tyr 



Pro Gly Gly 
15 

Asp Tyr Phe 
30 

Thr Lys Tyr 



Arg He 
45 

Gly Pro Phe He Ser 
60 

Leu Ser 



He Leu 

Arg Thr 

Lys Arg 
125 
Pro Leu 
140 

Lys Leu 

Ala Val 

Asp Ser 

Gly Leu 
205 
Ser Val 
220 

Pro Val 
Gly Ser 
Ala Cys 



Met Lys 
285 



Phe Pro He 
80 

Asp Gly Thr 
95 

Cys Ala Tyr 
110 

Leu Phe Leu 

Phe Arg Ser 

Ala Asp Lys 
160 

Ser Phe Asp 

175 
Thr Leu Ala 
190 

Gin Leu Ala 

Phe Glu Asp 

Lys Ser Arg 
240 

Phe Leu Asp 

255 
His Asn Leu 
270 

Val Leu Phe 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1813 
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Pro 


Tnr 


He 


Leu 


Lys 


Trp 


Val 


Gly 


















Lys 


Leu 


Ala 


Glu 


Glu 


Val 


Arg 


Tnr 


















rnr 


Fne 


Ser 


Ala 


Leu 


Glu 


T « ram 

Lys 


nec 




















AJLa 


Leu Arg 


Xle 


Glu 


Pro 


Pro 








340 










T 1 wfm 


oXU 


Asp 


He 


vax 


xxe 


(sxn 


Ser 






355 












Liys 


Gly 


Glu 


Thr 


Xle 


Phe 


Gly 


Tyr 




J /O 










375 




Lys 


Xle 


Phe 


Lys 


Glu 


Ser 


Glu 


Lys 












390 






Glu 


Glu 


Gly Glu 


Lys 


Leu 


Leu 


Lys 










405 








Glu 


Thr 


Val 


Glu 


Pro 


Thr 


Ala 


Glu 








420 










Leu 


Val 


Val Met Met Gly Arg 


Xle 






435 










440 


Tyr 


Asp 


Thr 


Phe 


Thr 


Val 


Asp 


Val 




450 










455 




Val 


Lys 


Phe 


Lys 


Ser 


Leu 


Thr 


Arg 


465 










470 






<210> 11 












<211> 482 












<212> PRT 












<213> Banana 










<400> 11 












Met 


Ala 


Met 


Met 


Tit? 


Ser 


Ser 


Ala 


1 








5 








Pro 


Thr 


Arg 


Pro 


Xle 


Pro 


Gly 


Ser 








20 








Leu 


Lys 


Asp 


Arg 


Leu Asp 


Tyr 


Phe 






35 












Phe 


Arg 


Ser 


Arg 


Met 


Ala 


Thr 


His 




50 










55 




Met 


Pro 


Pro 


Thr 


Phe 


Pro 


Phe 


Phe 


65 










70 






Thr 


Val 


Leu 


Asp 


Cys 


Thr 


Ser 


Pne 










85 








Val 


Glu 


. Lys 


Lys 


Asn 


Xle 


Leu 


xxe 








100 










Phe 


Thr 


Gly 


Asp Thr Arg Val 


Val 






115 










120 


Asp 


His 


Ala 


Arg 


Val 


Lys 


Ser 


Phe 




130 










135 




Ala 


Lys 


Thr 


Trp Val 


Ser 


Ser 


Phe 


145 










150 






Ala 


Thr 


Xle 


Glu 


Gin Gly 


Xle 


Ala 










165 








Gly 


Pro 


Leu 


Gin Lys 


Cys 


Xle 


Phe 



180 



Thr Gly 


Gly 


Glu Asp 


Leu 


His 


Arg 








300 










xnr 


vax 


Lys 


Glu 


Glu 


Gly Gly Leu 
















320 


Ser 


Leu 

*a 
330 


Leu 


Lys 


Ser 


vax 


Val 
335 


Tyr 


Val 


Pro 


Fne 


Gin Tyr 


Gly 


Lys 


Ala 


345 










OCA 






His 


Asp 


Ser 


Cys 


Phe 
365 


Lys 


Xle 


Lys 


Gin 


Pro 


pne 


Ala 
380 


Thr 


Lys 


Asp 


Pro 


Phe 


Val 


Gly 


Asp Arg 


Phe 


Val 


Gly 






395 










400 


Tyr 


Val 


Tyr 


Trp 


Ser 


Asn 


Glu Arg 




410 










415 




Asn Lys 


Gin 


Cys 


Pro 


Gly 


Lys 


Asn 


425 










430 






Xle 


Val 


Val 


Glu 


Phe 
445 


Phe. 


Leu 


Arg 


Ala Asp 


Leu 


Ala 


Leu 


Gly 


Pro Ala 








460 










Ala 


Thr 


Ala 
475 


Ser 


Val 









Ser 


Ala 
10 


Thr 


Ala 


Val 


Thr 


Thr 
15 


Leu 


Tyr Gly 


Pro 


Pro 


Leu 


Val 


Gly . Pro 


25 










30 






Trp 


Phe 


Gin 


Gly 


Pro 
45 


Glu 


Thr 


Phe 


Lys 


Ser 


Thr 


Val 
60 


Phe 


Arg 


Thr 


Asn 


Val 


Gly 


Val 


Asp 


Pro 


Arg Val Val 






75 










80 


Ser 


Ala 


Leu 


Phe 


Asp 


Leu 


Gin 


Val 




90 








95 




Gly Asp 


Tyr 


Met 


Pro 


Ser 


Leu 


Ser 


105 










110 






Val 


Tyr 


Leu 


Asp 


Pro 
125 


Ser 


Glu 


Pro 


Cys 


Leu 


Gin 


Leu 


Leu 


Arg Arg Gly 








140 










Leu 


Ser 


Asn 
155 


Leu 


Asp 


Val 


Met 


Leu 
160 


Lys 


Asp 


Gly 


Ser 


Ala 


Gly Leu 


Phe 




170 










175 




Ala 


Phe 


Leu 


Cys 


Lys 


Ser 


Xle 


Xle 


185 








190 
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OIV Al, .jp V.1 p ^ v,l Gly 01„ A=„ Oa^ Ph. Val 

H.t teu ..p Txp JJ. ^„ 

p. ... XX. P.„ P.O OX„ .X. 22 His se. P.. P„ 

L.U P„ p,. p,. S^^^^m,^^^ 

v.a clu ^ ojj ^0 ^ 

3iy ..u |„ ^p J2 ^„ ^ ^^^^ j,o 

Ph. J.„ M. p.. , J ^ J£ ^„ ^ 

-J XX. ox. ^ _ 300 ^ 

vax ^ V.X ... s.. .X. cx„ S Pro s.r P., |!° 
V.X oxu ^ vax ^ ^ III ^ 

X^ »^ p„ p,„ ^45 3^ 

T.» x-u S«: ^p „ J» 3JS 

|S CV. T„ OXn ^ vax . ^° 

A« Pro .X„ P.. ^ »3 «„ 

X^ ^ vax Ph. TTp ..r js„ Pro «u T>„ ox. Pro 

T>» PTC „. JJ; I-y. »»P xvr vax ill oxu ^r 

"a g. x^ ^ ^ „. ^ AT, ^ p,, 

^ «. A^ AXa XX, s.r VaX *r Ar. AX. Ar, GXu 

Trp Glu *'5 

<210> 12 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 12 

cggttcctct gcgcctctct cgccggcg 

<210> 13 28 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic Oligonucleotide 
<400> 13 

gcggaaccgg aggactaaaa cgcagc 

<210> 14 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 14 

ccatcctaat acgactcact atagggc 

<210> 15 
<211> 25 
<2i2> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 15 

cttggcgtag ttcctcagcc tcttg 

<210> 16 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 16 

actcactata gggctcgagc ggc 

<210> 17 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 17 

cgggatccat gttgttgaga acgatggcgg eg 

<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



PCT/USOO/05311 



26 



27 



25 



23 



32 



13 - 
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<220> 

<223> Synthetic Oligonucleotide 
<400> 18 

caatctccgg cgttctcgtc g 

<210> 19 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<221> misc_feature 
<222> (1)...(17) 
<223> n = A,T,C or G 



17 



<400> 19 

atnccnggnw sntaygg 

<210> 20 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<221> misc^feature 
<222> (1)._(17) 
<223> n = A,T,C or G 

<400> 20 

carccnytng argarat 

<210> 21 " 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<221> misc^feature 
<222> (1) . . . (17) 
<223> ii = A,T,C or G 

<400> 21 

atytcytcna mggytg 

<210> 22 ^'^ 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic Oligonucleotide 

<221> misc_feature 
<222> (1) . . . (20) 
<223> n = A,T,C or G 

<400> 22 

ggnttyaayg cntwyggngg 20 

<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic Oligonucleotide 

<221> misc_feature 
<222> (1) . . . (20) 
<223> n = A,T,C or G 

<400> 23 

ccnccrsang crttraancc 20 

<210> 24 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<221> misc_feature 
<222> (1) . . . (18) 
<223> n = A,T,C or G 

<400> 24 

taycarccny tngtnatg 18 

<210> 25 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<221> misc_feature 
<222> (1) . . . (18) 
<223> n = A,T,C or G 

<400> 25 

catnacnam ggytgrta .18 
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<210> 26 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<221> misc^feature 
<222> {1)...(17) 
<223> n = A,T,C or G 

<400> 26 

gtnttygayg anccnga 

<210> 27 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 

<221>. inisc_f eature 
<222> (1)..,(17) 
<223> n = A,T,C or G 

<400> 27 

tcnggntcrt craanac 

<210> 28 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic Oligonucleotide 

<221> misc_feature 
<222> (1) . . . (17) 
<223> n = A,T,C or G 

<400> 28 

ccngtytwng gnccrtt 

<210> 29 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<221> miscfeature 
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<222> (1) . . . (22) 
<223> n = A,T,C or G 

<400> 29 

cyttngcngc rcaytgyttr tt 22 

<210> 30 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 30 

cggtggagat cctcgccacc ggtgccgacc c 31 

<210> 31 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Synthetic Oligonucleotide 
<400> 31 

cttccttcac ggttgtcctc acttcctccg ccag 34 

<210> 32 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 32 

tccagcagcg ctgccccttt ctctccccgg 30 

<210> 33 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 33 

cactgtttgt tcttctcgct cggtgtcccc g . 31 

<210> 34 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
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31 



<220> 

<223> Synthetic Oligonucleotide 
<400> 34 

gggtcggcac cggtggcgag gatctccacc g 

<210> 35 
<211> 34 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 35 

ctggcggagg aagtgaggac aaccgtgaag gaag 

<210> 36 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 36 

ccggggagag aaaggggcag cgctgctga 

29 

<210> 37 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 37 

cggggacacc gagcgagaag aacaaacagt g 

<210> 38 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 38 

gacttggtac tggtggacta agcctaagtg tttc 

34 

<210> 39 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic Oligonucleotide 
<400> 39 

ggctgataac cacaaagaag ctcccctttc 30 

<210> 40 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 40 

gaaacactta ggcttagtcc accagtacca agtc 34 

<210> 41 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 41 

gaaaggggag cttctttgtg gttatcagcc 30 

<210> 42 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 42 

cgggatccat ggcttcttcc tcccctgaac ttc 33 

<210> 43 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic Oligonucleotide 
<400> 43 

tgccgaccca tttcagtata gtggg 25 

<210> 44 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic Oligonucleotide 
<400> 44 

ttcacaccat tcccctgcct ttcttccc 

28 
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